This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/CodeGen/
-
CodeGen/
-
CGDeclCXX.cpp
-
CGExpr.cpp
-
CGExprScalar.cpp
-
CodeGenFunction.h
-
test/CodeGenCXX/
-
CodeGenCXX/
-
invariant.cpp

Differential D75285

Mark restrict pointer or reference to const as invariant
AbandonedPublic

Authored by yaxunl on Feb 27 2020, 11:41 AM.

Download Raw Diff

Details

Reviewers

rjmccall
jeroen.dobbelaere
hfinkel
jdoerfert

Summary

We saw users intend to use const int* restrict to indicate the memory pointed to by the pointer is invariant.

This makes sense since restrict means the memory is not aliased by any other pointers whereas const
means the memory does not change.

Mark such pointer or reference as invariant allows more optimization opportunities.

This gives users a way to mark something as invariant.

Diff Detail

Event Timeline

yaxunl created this revision.Feb 27 2020, 11:41 AM

lebedev.ri added reviewers: jeroen.dobbelaere, hfinkel, jdoerfert.Feb 27 2020, 11:50 AM

Unfortunately, we cannot do this kind of thing just because it seems to make sense. The language semantics must be exactly satisfied by the IR-level semantics. I certainly agree that it would make sense for users to be able to mark invariant loads, but this mechanism simply might not be the right one.

One problem here is that, with something like:

char test2(X *x) {
  const char* __restrict p = &(x->b);
  return *p;
}

what happens when the function is inlined? Does the "invariantness" only still apply to accesses within the scope of the local restrict pointer? I believe that it would not, and that would be a problem because later code might legally modify the relevant data.

This revision now requires changes to proceed.Feb 27 2020, 12:00 PM

Unfortunately, const also doesn't mean that the memory doesn't change. It does mean it can't be changed through this pointer, but restrict allows you to derive more pointers from it within the restrict scope, and those pointers can remove the const qualifier.

If this is not the right way to tell the compiler a memory pointed to by a pointer is invariant, what is the recommended way?

Can we introduce clang builtins for llvm.invariant.start and llvm.invariant.end to allow user to specify that?

Thanks.

Are you sure restrict alone isn't good enough? It doesn't directly tell you that the memory is invariant, but it's usually simple to prove that the memory isn't modified within the restrict scope, which might be sufficient.

In D75285#1896400, @hfinkel wrote:
Unfortunately, we cannot do this kind of thing just because it seems to make sense. The language semantics must be exactly satisfied by the IR-level semantics. I certainly agree that it would make sense for users to be able to mark invariant loads, but this mechanism simply might not be the right one.

One problem here is that, with something like:
char test2(X *x) {
  const char* __restrict p = &(x->b);
  return *p;
}
what happens when the function is inlined? Does the "invariantness" only still apply to accesses within the scope of the local restrict pointer? I believe that it would not, and that would be a problem because later code might legally modify the relevant data.

How about inserting llvm.invariant.end at the end of scope of the variable?

In D75285#1896458, @rjmccall wrote:

Unfortunately, const also doesn't mean that the memory doesn't change. It does mean it can't be changed through this pointer, but restrict allows you to derive more pointers from it within the restrict scope, and those pointers can remove the const qualifier.

If users derive a non-const pointer from the const pointer and modify it, doesn't that result in UB? Thanks.

I don't think that 'restrict' is a good match for this behavior. For c++, the alias_set proposal (http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2014/n4150.pdf) would be a better match.
You would put the read access of *p in its own universe; or even better, something like

struct X {
  int a;
  char [[alias_set(MyOwnUniverseForX_b)]]  b;
};

Unfortunatly, there is no implementation yet.

Imho, adding a 'attribute((invariant))' or something similar (immutable ? const_invariant ?) would be a better approach. (hmm, I like 'immutable')

char test2(X *x) {
  const char __attribute__((immutable))  *p = (const char __attribute__((immutable))*)&(x->b);
  // for all i:  p[i] will never be modified.
  return *p;
}

Extra precautions are probably needed to ensure that the initialization of x->b is separated from the usage of it.

In D75285#1897247, @yaxunl wrote:

If users derive a non-const pointer from the const pointer and modify it, doesn't that result in UB? Thanks.

No. Modifying a const object is UB, so e.g. we can segv if it's in .rodata, but a const pointer is not necessarily a pointer to a const object. If it's a const pointer to a non-const object then one can cast it directly to a non-const pointer and mutate at will.

This unfortunately makes 'const int*' of rather less use than it would otherwise be.

In D75285#1897537, @JonChesterfield wrote:

In D75285#1897247, @yaxunl wrote:

If users derive a non-const pointer from the const pointer and modify it, doesn't that result in UB? Thanks.

No. Modifying a const object is UB, so e.g. we can segv if it's in .rodata, but a const pointer is not necessarily a pointer to a const object. If it's a const pointer to a non-const object then one can cast it directly to a non-const pointer and mutate at will.

This unfortunately makes 'const int*' of rather less use than it would otherwise be.

Right. Note that this UB extends to all const objects, even locals, which is something that I don't think we currently have a good way to take advantage of in LLVM.

Unfortunately, this probably doesn't help Yaxun's use case.

In D75285#1897502, @jeroen.dobbelaere wrote:

I don't think that 'restrict' is a good match for this behavior. For c++, the alias_set proposal (http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2014/n4150.pdf) would be a better match.

Oh yes. Thanks for the reminder. We have only 18 months left if we want something like that in C++23...

In D75285#1897502, @jeroen.dobbelaere wrote:
I don't think that 'restrict' is a good match for this behavior. For c++, the alias_set proposal (http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2014/n4150.pdf) would be a better match.
You would put the read access of *p in its own universe; or even better, something like
struct X {
  int a;
  char [[alias_set(MyOwnUniverseForX_b)]]  b;
};
Unfortunatly, there is no implementation yet.

Imho, adding a 'attribute((invariant))' or something similar (immutable ? const_invariant ?) would be a better approach. (hmm, I like 'immutable')
char test2(X *x) {
  const char __attribute__((immutable))  *p = (const char __attribute__((immutable))*)&(x->b);
  // for all i:  p[i] will never be modified.
  return *p;
}
Extra precautions are probably needed to ensure that the initialization of x->b is separated from the usage of it.

Agree. An attribute like __attribute__((immutable)) should be useful. It can be used on a pointer or reference and tells the compiler that
the content of the pointer or reference is invariant in the scope of that variable.

Anastasia added a subscriber: Anastasia.Mar 3 2020, 4:24 AM

In D75285#1896610, @rjmccall wrote:

Are you sure restrict alone isn't good enough? It doesn't directly tell you that the memory is invariant, but it's usually simple to prove that the memory isn't modified within the restrict scope, which might be sufficient.

Do you mean to prove in analysis passes? Should we emit some sort of hints from the frontend to indicate what to look for?

In D75285#1902788, @Anastasia wrote:

In D75285#1896610, @rjmccall wrote:

Are you sure restrict alone isn't good enough? It doesn't directly tell you that the memory is invariant, but it's usually simple to prove that the memory isn't modified within the restrict scope, which might be sufficient.

Do you mean to prove in analysis passes? Should we emit some sort of hints from the frontend to indicate what to look for?

Not sure what you mean with 'hints from the frontend', but D68484 (and later) contain a significant improvement to clang's handling of restrict. That could make the restrict path feasible (if that would support the actual use case).

In D75285#1902835, @jeroen.dobbelaere wrote:

In D75285#1902788, @Anastasia wrote:

In D75285#1896610, @rjmccall wrote:

Are you sure restrict alone isn't good enough? It doesn't directly tell you that the memory is invariant, but it's usually simple to prove that the memory isn't modified within the restrict scope, which might be sufficient.

Do you mean to prove in analysis passes? Should we emit some sort of hints from the frontend to indicate what to look for?

Not sure what you mean with 'hints from the frontend', but D68484 (and later) contain a significant improvement to clang's handling of restrict. That could make the restrict path feasible (if that would support the actual use case).

I think there are cases that noalias is not sufficient to prove invariance. For example, a global variable, even if we mark it as restrict and we do not modify it in a function, the compiler is still not sure it is invariant in that function, since it may be modified by another thread. In this case, if a user knows that it is invariant in that function, he would just want to mark it as __invariant__ or __immutable__.

In D75285#1903284, @yaxunl wrote:

In D75285#1902835, @jeroen.dobbelaere wrote:

In D75285#1902788, @Anastasia wrote:

In D75285#1896610, @rjmccall wrote:

Are you sure restrict alone isn't good enough? It doesn't directly tell you that the memory is invariant, but it's usually simple to prove that the memory isn't modified within the restrict scope, which might be sufficient.

Do you mean to prove in analysis passes? Should we emit some sort of hints from the frontend to indicate what to look for?

Not sure what you mean with 'hints from the frontend', but D68484 (and later) contain a significant improvement to clang's handling of restrict. That could make the restrict path feasible (if that would support the actual use case).

I think there are cases that noalias is not sufficient to prove invariance. For example, a global variable, even if we mark it as restrict and we do not modify it in a function, the compiler is still not sure it is invariant in that function, since it may be modified by another thread. In this case, if a user knows that it is invariant in that function, he would just want to mark it as __invariant__ or __immutable__.

That is not true for two reasons: first, restrict guarantees that the variable is not accessed through any non-derived l-value within its scope, and that would certainly include from other threads; and second, it is undefined behavior for two threads to access the same object without synchronizing anyway (unless they're both just reading from it).

In D75285#1903444, @rjmccall wrote:

That is not true for two reasons: first, restrict guarantees that the variable is not accessed through any non-derived l-value within its scope, and that would certainly include from other threads; and second, it is undefined behavior for two threads to access the same object without synchronizing anyway (unless they're both just reading from it).

How about the cases where users cannot use restrict but they still want to mark a pointer as invariant? Or even though restrict is used but it is too complicated for alias analysis to deduce invariance?

In D75285#1903611, @yaxunl wrote:

In D75285#1903444, @rjmccall wrote:

That is not true for two reasons: first, restrict guarantees that the variable is not accessed through any non-derived l-value within its scope, and that would certainly include from other threads; and second, it is undefined behavior for two threads to access the same object without synchronizing anyway (unless they're both just reading from it).

How about the cases where users cannot use restrict but they still want to mark a pointer as invariant?

I'm not sure what cases those would be; I'm pretty sure that if memory is invariant then you can always use restrict.

Or even though restrict is used but it is too complicated for alias analysis to deduce invariance?

I asked before if there was a specific optimization problem you were trying to solve, and I still have that question. It kindof feels like somebody's already decided that they don't want to use alias analysis for something, so now you're looking for ways to do it without alias analysis, even though alias analysis might be a satisfactory way of solving the problem. restrict gives us a *lot* of informatiion; I'm sure there are places where we don't preserve it well enough to do some optimization, but that can be improved without needing a whole new language feature.

In D75285#1903611, @yaxunl wrote:

In D75285#1903444, @rjmccall wrote:

That is not true for two reasons: first, restrict guarantees that the variable is not accessed through any non-derived l-value within its scope, and that would certainly include from other threads; and second, it is undefined behavior for two threads to access the same object without synchronizing anyway (unless they're both just reading from it).

How about the cases where users cannot use restrict but they still want to mark a pointer as invariant? Or even though restrict is used but it is too complicated for alias analysis to deduce invariance?

If we can reuse existing attributes it is better than adding new ones so my preference would be to make restrict work unless it's absolutely impossible for the use case you consider.

yaxunl abandoned this revision.Mar 30 2020, 8:58 AM

Revision Contents

Path

Size

clang/

lib/

CodeGen/

10 lines

157 lines

14 lines

2 lines

test/

CodeGenCXX/

invariant.cpp

28 lines

Diff 247047

clang/lib/CodeGen/CGDeclCXX.cpp

	Show First 20 Lines • Show All 147 Lines • ▼ Show 20 Lines
	/// Emit code to cause the variable at the given address to be considered as			/// Emit code to cause the variable at the given address to be considered as
	/// constant from this point onwards.			/// constant from this point onwards.
	static void EmitDeclInvariant(CodeGenFunction &CGF, const VarDecl &D,			static void EmitDeclInvariant(CodeGenFunction &CGF, const VarDecl &D,
	llvm::Constant *Addr) {			llvm::Constant *Addr) {
	return CGF.EmitInvariantStart(			return CGF.EmitInvariantStart(
	Addr, CGF.getContext().getTypeSizeInChars(D.getType()));			Addr, CGF.getContext().getTypeSizeInChars(D.getType()));
	}			}

	void CodeGenFunction::EmitInvariantStart(llvm::Constant *Addr, CharUnits Size) {			void CodeGenFunction::EmitInvariantStart(llvm::Value *Addr, CharUnits Size) {
	// Do not emit the intrinsic if we're not optimizing.			// Do not emit the intrinsic if we're not optimizing.
	if (!CGM.getCodeGenOpts().OptimizationLevel)			if (!CGM.getCodeGenOpts().OptimizationLevel)
	return;			return;

	// Grab the llvm.invariant.start intrinsic.			// Grab the llvm.invariant.start intrinsic.
	llvm::Intrinsic::ID InvStartID = llvm::Intrinsic::invariant_start;			llvm::Intrinsic::ID InvStartID = llvm::Intrinsic::invariant_start;
	// Overloaded address space type.			// Overloaded address space type.
	llvm::Type *ObjectPtr[1] = {Int8PtrTy};			llvm::Type *ObjectPtr[1] = {Int8PtrTy};
	llvm::Function *InvariantStart = CGM.getIntrinsic(InvStartID, ObjectPtr);			llvm::Function *InvariantStart = CGM.getIntrinsic(InvStartID, ObjectPtr);

	// Emit a call with the size in bytes of the object.			// Emit a call with the size in bytes of the object.
	uint64_t Width = Size.getQuantity();			uint64_t Width = Size.getQuantity();
	llvm::Value *Args[2] = { llvm::ConstantInt::getSigned(Int64Ty, Width),			llvm::Value *Cast;
	llvm::ConstantExpr::getBitCast(Addr, Int8PtrTy)};			if (llvm::Constant *C = dyn_cast<llvm::Constant>(Addr))
				Cast = llvm::ConstantExpr::getBitCast(C, Int8PtrTy);
				else
				Cast = Builder.CreateBitCast(Addr, Int8PtrTy);
				llvm::Value *Args[2] = {llvm::ConstantInt::getSigned(Int64Ty, Width), Cast};
	Builder.CreateCall(InvariantStart, Args);			Builder.CreateCall(InvariantStart, Args);
	}			}

	void CodeGenFunction::EmitCXXGlobalVarDeclInit(const VarDecl &D,			void CodeGenFunction::EmitCXXGlobalVarDeclInit(const VarDecl &D,
	llvm::Constant *DeclPtr,			llvm::Constant *DeclPtr,
	bool PerformInit) {			bool PerformInit) {

	const Expr *Init = D.getInit();			const Expr *Init = D.getInit();
	▲ Show 20 Lines • Show All 604 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGExpr.cpp

	Show First 20 Lines • Show All 1,246 Lines • ▼ Show 20 Lines
	///			///
	/// If this returns a normal address, and if the lvalue's C type is fixed size,			/// If this returns a normal address, and if the lvalue's C type is fixed size,
	/// this method guarantees that the returned pointer type will point to an LLVM			/// this method guarantees that the returned pointer type will point to an LLVM
	/// type of the same size of the lvalue's type. If the lvalue has a variable			/// type of the same size of the lvalue's type. If the lvalue has a variable
	/// length type, this is not possible.			/// length type, this is not possible.
	///			///
	LValue CodeGenFunction::EmitLValue(const Expr *E) {			LValue CodeGenFunction::EmitLValue(const Expr *E) {
	ApplyDebugLocation DL(*this, E);			ApplyDebugLocation DL(*this, E);
				LValue Ret;
	switch (E->getStmtClass()) {			switch (E->getStmtClass()) {
	default: return EmitUnsupportedLValue(E, "l-value expression");			default:
				Ret = EmitUnsupportedLValue(E, "l-value expression");
				break;

	case Expr::ObjCPropertyRefExprClass:			case Expr::ObjCPropertyRefExprClass:
	llvm_unreachable("cannot emit a property reference directly");			llvm_unreachable("cannot emit a property reference directly");
				break;

	case Expr::ObjCSelectorExprClass:			case Expr::ObjCSelectorExprClass:
	return EmitObjCSelectorLValue(cast<ObjCSelectorExpr>(E));			Ret = EmitObjCSelectorLValue(cast<ObjCSelectorExpr>(E));
				break;
	case Expr::ObjCIsaExprClass:			case Expr::ObjCIsaExprClass:
	return EmitObjCIsaExpr(cast<ObjCIsaExpr>(E));			Ret = EmitObjCIsaExpr(cast<ObjCIsaExpr>(E));
				break;
	case Expr::BinaryOperatorClass:			case Expr::BinaryOperatorClass:
	return EmitBinaryOperatorLValue(cast<BinaryOperator>(E));			Ret = EmitBinaryOperatorLValue(cast<BinaryOperator>(E));
				break;
	case Expr::CompoundAssignOperatorClass: {			case Expr::CompoundAssignOperatorClass: {
	QualType Ty = E->getType();			QualType Ty = E->getType();
	if (const AtomicType *AT = Ty->getAs<AtomicType>())			if (const AtomicType *AT = Ty->getAs<AtomicType>())
	Ty = AT->getValueType();			Ty = AT->getValueType();
	if (!Ty->isAnyComplexType())			if (!Ty->isAnyComplexType())
	return EmitCompoundAssignmentLValue(cast<CompoundAssignOperator>(E));			Ret = EmitCompoundAssignmentLValue(cast<CompoundAssignOperator>(E));
	return EmitComplexCompoundAssignmentLValue(cast<CompoundAssignOperator>(E));			else
				Ret =
				EmitComplexCompoundAssignmentLValue(cast<CompoundAssignOperator>(E));
				break;
	}			}
	case Expr::CallExprClass:			case Expr::CallExprClass:
	case Expr::CXXMemberCallExprClass:			case Expr::CXXMemberCallExprClass:
	case Expr::CXXOperatorCallExprClass:			case Expr::CXXOperatorCallExprClass:
	case Expr::UserDefinedLiteralClass:			case Expr::UserDefinedLiteralClass:
	return EmitCallExprLValue(cast<CallExpr>(E));			Ret = EmitCallExprLValue(cast<CallExpr>(E));
				break;
	case Expr::CXXRewrittenBinaryOperatorClass:			case Expr::CXXRewrittenBinaryOperatorClass:
	return EmitLValue(cast<CXXRewrittenBinaryOperator>(E)->getSemanticForm());			Ret = EmitLValue(cast<CXXRewrittenBinaryOperator>(E)->getSemanticForm());
				break;
	case Expr::VAArgExprClass:			case Expr::VAArgExprClass:
	return EmitVAArgExprLValue(cast<VAArgExpr>(E));			Ret = EmitVAArgExprLValue(cast<VAArgExpr>(E));
				break;
	case Expr::DeclRefExprClass:			case Expr::DeclRefExprClass:
	return EmitDeclRefLValue(cast<DeclRefExpr>(E));			Ret = EmitDeclRefLValue(cast<DeclRefExpr>(E));
				break;
	case Expr::ConstantExprClass:			case Expr::ConstantExprClass:
	return EmitLValue(cast<ConstantExpr>(E)->getSubExpr());			Ret = EmitLValue(cast<ConstantExpr>(E)->getSubExpr());
				break;
	case Expr::ParenExprClass:			case Expr::ParenExprClass:
	return EmitLValue(cast<ParenExpr>(E)->getSubExpr());			Ret = EmitLValue(cast<ParenExpr>(E)->getSubExpr());
				break;
	case Expr::GenericSelectionExprClass:			case Expr::GenericSelectionExprClass:
	return EmitLValue(cast<GenericSelectionExpr>(E)->getResultExpr());			Ret = EmitLValue(cast<GenericSelectionExpr>(E)->getResultExpr());
				break;
	case Expr::PredefinedExprClass:			case Expr::PredefinedExprClass:
	return EmitPredefinedLValue(cast<PredefinedExpr>(E));			Ret = EmitPredefinedLValue(cast<PredefinedExpr>(E));
				break;
	case Expr::StringLiteralClass:			case Expr::StringLiteralClass:
	return EmitStringLiteralLValue(cast<StringLiteral>(E));			Ret = EmitStringLiteralLValue(cast<StringLiteral>(E));
				break;
	case Expr::ObjCEncodeExprClass:			case Expr::ObjCEncodeExprClass:
	return EmitObjCEncodeExprLValue(cast<ObjCEncodeExpr>(E));			Ret = EmitObjCEncodeExprLValue(cast<ObjCEncodeExpr>(E));
				break;
	case Expr::PseudoObjectExprClass:			case Expr::PseudoObjectExprClass:
	return EmitPseudoObjectLValue(cast<PseudoObjectExpr>(E));			Ret = EmitPseudoObjectLValue(cast<PseudoObjectExpr>(E));
				break;
	case Expr::InitListExprClass:			case Expr::InitListExprClass:
	return EmitInitListLValue(cast<InitListExpr>(E));			Ret = EmitInitListLValue(cast<InitListExpr>(E));
				break;
	case Expr::CXXTemporaryObjectExprClass:			case Expr::CXXTemporaryObjectExprClass:
	case Expr::CXXConstructExprClass:			case Expr::CXXConstructExprClass:
	return EmitCXXConstructLValue(cast<CXXConstructExpr>(E));			Ret = EmitCXXConstructLValue(cast<CXXConstructExpr>(E));
				break;
	case Expr::CXXBindTemporaryExprClass:			case Expr::CXXBindTemporaryExprClass:
	return EmitCXXBindTemporaryLValue(cast<CXXBindTemporaryExpr>(E));			Ret = EmitCXXBindTemporaryLValue(cast<CXXBindTemporaryExpr>(E));
				break;
	case Expr::CXXUuidofExprClass:			case Expr::CXXUuidofExprClass:
	return EmitCXXUuidofLValue(cast<CXXUuidofExpr>(E));			Ret = EmitCXXUuidofLValue(cast<CXXUuidofExpr>(E));
				break;
	case Expr::LambdaExprClass:			case Expr::LambdaExprClass:
	return EmitAggExprToLValue(E);			Ret = EmitAggExprToLValue(E);
				break;

	case Expr::ExprWithCleanupsClass: {			case Expr::ExprWithCleanupsClass: {
	const auto *cleanups = cast<ExprWithCleanups>(E);			const auto *cleanups = cast<ExprWithCleanups>(E);
	enterFullExpression(cleanups);			enterFullExpression(cleanups);
	RunCleanupsScope Scope(*this);			RunCleanupsScope Scope(*this);
	LValue LV = EmitLValue(cleanups->getSubExpr());			LValue LV = EmitLValue(cleanups->getSubExpr());
	if (LV.isSimple()) {			if (LV.isSimple()) {
	// Defend against branches out of gnu statement expressions surrounded by			// Defend against branches out of gnu statement expressions surrounded by
	// cleanups.			// cleanups.
	llvm::Value V = LV.getPointer(this);			llvm::Value V = LV.getPointer(this);
	Scope.ForceCleanup({&V});			Scope.ForceCleanup({&V});
	return LValue::MakeAddr(Address(V, LV.getAlignment()), LV.getType(),			Ret = LValue::MakeAddr(Address(V, LV.getAlignment()), LV.getType(),
	getContext(), LV.getBaseInfo(), LV.getTBAAInfo());			getContext(), LV.getBaseInfo(), LV.getTBAAInfo());
	}			}
	// FIXME: Is it possible to create an ExprWithCleanups that produces a			// FIXME: Is it possible to create an ExprWithCleanups that produces a
	// bitfield lvalue or some other non-simple lvalue?			// bitfield lvalue or some other non-simple lvalue?
	return LV;			else
				Ret = LV;
				break;
	}			}

	case Expr::CXXDefaultArgExprClass: {			case Expr::CXXDefaultArgExprClass: {
	auto *DAE = cast<CXXDefaultArgExpr>(E);			auto *DAE = cast<CXXDefaultArgExpr>(E);
	CXXDefaultArgExprScope Scope(*this, DAE);			CXXDefaultArgExprScope Scope(*this, DAE);
	return EmitLValue(DAE->getExpr());			Ret = EmitLValue(DAE->getExpr());
				break;
	}			}
	case Expr::CXXDefaultInitExprClass: {			case Expr::CXXDefaultInitExprClass: {
	auto *DIE = cast<CXXDefaultInitExpr>(E);			auto *DIE = cast<CXXDefaultInitExpr>(E);
	CXXDefaultInitExprScope Scope(*this, DIE);			CXXDefaultInitExprScope Scope(*this, DIE);
	return EmitLValue(DIE->getExpr());			Ret = EmitLValue(DIE->getExpr());
				break;
	}			}
	case Expr::CXXTypeidExprClass:			case Expr::CXXTypeidExprClass:
	return EmitCXXTypeidLValue(cast<CXXTypeidExpr>(E));			Ret = EmitCXXTypeidLValue(cast<CXXTypeidExpr>(E));
				break;

	case Expr::ObjCMessageExprClass:			case Expr::ObjCMessageExprClass:
	return EmitObjCMessageExprLValue(cast<ObjCMessageExpr>(E));			Ret = EmitObjCMessageExprLValue(cast<ObjCMessageExpr>(E));
				break;
	case Expr::ObjCIvarRefExprClass:			case Expr::ObjCIvarRefExprClass:
	return EmitObjCIvarRefLValue(cast<ObjCIvarRefExpr>(E));			Ret = EmitObjCIvarRefLValue(cast<ObjCIvarRefExpr>(E));
				break;
	case Expr::StmtExprClass:			case Expr::StmtExprClass:
	return EmitStmtExprLValue(cast<StmtExpr>(E));			Ret = EmitStmtExprLValue(cast<StmtExpr>(E));
				break;
	case Expr::UnaryOperatorClass:			case Expr::UnaryOperatorClass:
	return EmitUnaryOpLValue(cast<UnaryOperator>(E));			Ret = EmitUnaryOpLValue(cast<UnaryOperator>(E));
				break;
	case Expr::ArraySubscriptExprClass:			case Expr::ArraySubscriptExprClass:
	return EmitArraySubscriptExpr(cast<ArraySubscriptExpr>(E));			Ret = EmitArraySubscriptExpr(cast<ArraySubscriptExpr>(E));
				break;
	case Expr::OMPArraySectionExprClass:			case Expr::OMPArraySectionExprClass:
	return EmitOMPArraySectionExpr(cast<OMPArraySectionExpr>(E));			Ret = EmitOMPArraySectionExpr(cast<OMPArraySectionExpr>(E));
				break;
	case Expr::ExtVectorElementExprClass:			case Expr::ExtVectorElementExprClass:
	return EmitExtVectorElementExpr(cast<ExtVectorElementExpr>(E));			Ret = EmitExtVectorElementExpr(cast<ExtVectorElementExpr>(E));
				break;
	case Expr::MemberExprClass:			case Expr::MemberExprClass:
	return EmitMemberExpr(cast<MemberExpr>(E));			Ret = EmitMemberExpr(cast<MemberExpr>(E));
				break;
	case Expr::CompoundLiteralExprClass:			case Expr::CompoundLiteralExprClass:
	return EmitCompoundLiteralLValue(cast<CompoundLiteralExpr>(E));			Ret = EmitCompoundLiteralLValue(cast<CompoundLiteralExpr>(E));
				break;
	case Expr::ConditionalOperatorClass:			case Expr::ConditionalOperatorClass:
	return EmitConditionalOperatorLValue(cast<ConditionalOperator>(E));			Ret = EmitConditionalOperatorLValue(cast<ConditionalOperator>(E));
				break;
	case Expr::BinaryConditionalOperatorClass:			case Expr::BinaryConditionalOperatorClass:
	return EmitConditionalOperatorLValue(cast<BinaryConditionalOperator>(E));			Ret = EmitConditionalOperatorLValue(cast<BinaryConditionalOperator>(E));
				break;
	case Expr::ChooseExprClass:			case Expr::ChooseExprClass:
	return EmitLValue(cast<ChooseExpr>(E)->getChosenSubExpr());			Ret = EmitLValue(cast<ChooseExpr>(E)->getChosenSubExpr());
				break;
	case Expr::OpaqueValueExprClass:			case Expr::OpaqueValueExprClass:
	return EmitOpaqueValueLValue(cast<OpaqueValueExpr>(E));			Ret = EmitOpaqueValueLValue(cast<OpaqueValueExpr>(E));
				break;
	case Expr::SubstNonTypeTemplateParmExprClass:			case Expr::SubstNonTypeTemplateParmExprClass:
	return EmitLValue(cast<SubstNonTypeTemplateParmExpr>(E)->getReplacement());			Ret = EmitLValue(cast<SubstNonTypeTemplateParmExpr>(E)->getReplacement());
				break;
	case Expr::ImplicitCastExprClass:			case Expr::ImplicitCastExprClass:
	case Expr::CStyleCastExprClass:			case Expr::CStyleCastExprClass:
	case Expr::CXXFunctionalCastExprClass:			case Expr::CXXFunctionalCastExprClass:
	case Expr::CXXStaticCastExprClass:			case Expr::CXXStaticCastExprClass:
	case Expr::CXXDynamicCastExprClass:			case Expr::CXXDynamicCastExprClass:
	case Expr::CXXReinterpretCastExprClass:			case Expr::CXXReinterpretCastExprClass:
	case Expr::CXXConstCastExprClass:			case Expr::CXXConstCastExprClass:
	case Expr::ObjCBridgedCastExprClass:			case Expr::ObjCBridgedCastExprClass:
	return EmitCastLValue(cast<CastExpr>(E));			Ret = EmitCastLValue(cast<CastExpr>(E));
				break;

	case Expr::MaterializeTemporaryExprClass:			case Expr::MaterializeTemporaryExprClass:
	return EmitMaterializeTemporaryExpr(cast<MaterializeTemporaryExpr>(E));			Ret = EmitMaterializeTemporaryExpr(cast<MaterializeTemporaryExpr>(E));
				break;

	case Expr::CoawaitExprClass:			case Expr::CoawaitExprClass:
	return EmitCoawaitLValue(cast<CoawaitExpr>(E));			Ret = EmitCoawaitLValue(cast<CoawaitExpr>(E));
				break;
	case Expr::CoyieldExprClass:			case Expr::CoyieldExprClass:
	return EmitCoyieldLValue(cast<CoyieldExpr>(E));			Ret = EmitCoyieldLValue(cast<CoyieldExpr>(E));
				break;
	}			}

				// Mark a restrict reference to const as invariant.
				// ToDo: Now we only handle DeclRefExpr. We should handle more cases later.
				if (const auto *DRE = dyn_cast<DeclRefExpr>(E)) {
				const ValueDecl *VD = DRE->getDecl();
				auto QT = VD->getType();
				if (QT->isReferenceType()) {
				auto PointeeTy = QT->getPointeeType();
				if (PointeeTy.isConstQualified() && QT.isRestrictQualified()) {
				EmitInvariantStart(Ret.getPointer(*this),
				getContext().getTypeSizeInChars(PointeeTy));
				}
				}
				}

				return Ret;
	}			}

	/// Given an object of the given canonical type, can we safely copy a			/// Given an object of the given canonical type, can we safely copy a
	/// value out of it based on its initializer?			/// value out of it based on its initializer?
	static bool isConstantEmittableObjectType(QualType type) {			static bool isConstantEmittableObjectType(QualType type) {
	assert(type.isCanonical());			assert(type.isCanonical());
	assert(!type->isReferenceType());			assert(!type->isReferenceType());

	▲ Show 20 Lines • Show All 3,800 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGExprScalar.cpp

Show First 20 Lines • Show All 302 Lines • ▼ Show 20 Lines	public:
/// EmitLoadOfLValue - Given an expression with complex type that represents a		/// EmitLoadOfLValue - Given an expression with complex type that represents a
/// value l-value, this method emits the address of the l-value, then loads		/// value l-value, this method emits the address of the l-value, then loads
/// and returns the result.		/// and returns the result.
Value EmitLoadOfLValue(const Expr E) {		Value EmitLoadOfLValue(const Expr E) {
Value *V = EmitLoadOfLValue(EmitCheckedLValue(E, CodeGenFunction::TCK_Load),		Value *V = EmitLoadOfLValue(EmitCheckedLValue(E, CodeGenFunction::TCK_Load),
E->getExprLoc());		E->getExprLoc());

EmitLValueAlignmentAssumption(E, V);		EmitLValueAlignmentAssumption(E, V);

		// Mark a restrict pointer or to const as invariant.
		if (const auto *DRE = dyn_cast<DeclRefExpr>(E)) {
		const ValueDecl *VD = DRE->getDecl();
		auto QT = VD->getType();
		if (QT->isPointerType()) {
		auto PointeeTy = QT->getPointeeType();
		if (PointeeTy.isConstQualified() && QT.isRestrictQualified()) {
		CGF.EmitInvariantStart(
		V, CGF.getContext().getTypeSizeInChars(PointeeTy));
		}
		}
		}

return V;		return V;
}		}

/// EmitConversionToBool - Convert the specified expression value to a		/// EmitConversionToBool - Convert the specified expression value to a
/// boolean (i1) truth value. This is equivalent to "Val != 0".		/// boolean (i1) truth value. This is equivalent to "Val != 0".
Value EmitConversionToBool(Value Src, QualType DstTy);		Value EmitConversionToBool(Value Src, QualType DstTy);

/// Emit a check that a conversion from a floating-point type does not		/// Emit a check that a conversion from a floating-point type does not
▲ Show 20 Lines • Show All 4,575 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.h

Show First 20 Lines • Show All 4,064 Lines • ▼ Show 20 Lines	public:
/// global variable that has already been created for it. If the initializer		/// global variable that has already been created for it. If the initializer
/// has a different type than GV does, this may free GV and return a different		/// has a different type than GV does, this may free GV and return a different
/// one. Otherwise it just returns GV.		/// one. Otherwise it just returns GV.
llvm::GlobalVariable *		llvm::GlobalVariable *
AddInitializerToStaticVarDecl(const VarDecl &D,		AddInitializerToStaticVarDecl(const VarDecl &D,
llvm::GlobalVariable *GV);		llvm::GlobalVariable *GV);

// Emit an @llvm.invariant.start call for the given memory region.		// Emit an @llvm.invariant.start call for the given memory region.
void EmitInvariantStart(llvm::Constant *Addr, CharUnits Size);		void EmitInvariantStart(llvm::Value *Addr, CharUnits Size);

/// EmitCXXGlobalVarDeclInit - Create the initializer for a C++		/// EmitCXXGlobalVarDeclInit - Create the initializer for a C++
/// variable with global storage.		/// variable with global storage.
void EmitCXXGlobalVarDeclInit(const VarDecl &D, llvm::Constant *DeclPtr,		void EmitCXXGlobalVarDeclInit(const VarDecl &D, llvm::Constant *DeclPtr,
bool PerformInit);		bool PerformInit);

llvm::Function *createAtExitStub(const VarDecl &VD, llvm::FunctionCallee Dtor,		llvm::Function *createAtExitStub(const VarDecl &VD, llvm::FunctionCallee Dtor,
llvm::Constant *Addr);		llvm::Constant *Addr);
▲ Show 20 Lines • Show All 453 Lines • Show Last 20 Lines

clang/test/CodeGenCXX/invariant.cpp

This file was added.

				// RUN: %clang_cc1 -O3 -emit-llvm -o - %s \| FileCheck %s

				typedef struct {
				int a;
				char b;
				} X;

				const int* __restrict p;

				// CHECK-LABEL: test1
				// CHECK: llvm.invariant.start
				int test1() {
				return *p;
				}

				// CHECK-LABEL: test2
				// CHECK: llvm.invariant.start
				char test2(X *x) {
				const char* __restrict p = &(x->b);
				return *p;
				}

				// CHECK-LABEL: test3
				// CHECK: llvm.invariant.start
				char test3(X &x) {
				const char& __restrict p = x.b;
				return p;
				}