This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/clang/AST/
-
clang/
-
AST/
-
Expr.h
-
lib/
-
AST/
-
ExprConstant.cpp
-
CodeGen/
4/17
CGExprConstant.cpp
-
test/
-
CodeGen/
-
const-init.c
-
designated-initializers.c
-
union-init2.c
-
CodeGenCXX/
-
cxx11-initializer-aggregate.cpp
-
cxx1z-initializer-aggregate.cpp
-
SemaCXX/
-
large-array-init.cpp

Differential D46241

[CodeGen] Recognize more cases of zero initialization
ClosedPublic

Authored by sepavloff on Apr 29 2018, 10:55 PM.

Download Raw Diff

Details

Reviewers

rjmccall
rsmith

Commits

rG9f8068420a7b: [CodeGen] Recognize more cases of zero initialization
rL332847: [CodeGen] Recognize more cases of zero initialization
rC332847: [CodeGen] Recognize more cases of zero initialization

Summary

If a variable has initializer, codegen tries to build its value. It
causes strange behavior from user viewpoint: compilation of huge zero
initialized arrays like:

int char data_1[2147483648u] = { 0 };

consumes enormous amount of time and memory.

With this change compiler recognizes more patterns when variable is
initialized with zeros and elementwise treatment can be avoided.

Diff Detail

Repository: rC Clang

Event Timeline

sepavloff created this revision.Apr 29 2018, 10:55 PM

dberris added a subscriber: dberris.Apr 29 2018, 11:05 PM

rjmccall added inline comments.Apr 30 2018, 3:26 PM

lib/CodeGen/CGExprConstant.cpp
1395	You should have a comment here clarifying that this is checking whether the initializer is equivalent to a C++ zero initialization, not checking whether the initializer produces a zero bit-pattern.
1414	You should check the array fill expression here; for now, the test case would be something like `StructWithNonTrivialDefaultInit sArr[10] = {};`.
1415	I would suggest doing your primary switch over the form of Init instead of its type, and you can just have an isIntegerConstantExpr check at the end. You should also check for a null pointer expression.

Addressed review notes.

sepavloff marked 3 inline comments as done.May 2 2018, 10:04 AM

sepavloff added inline comments.

lib/CodeGen/CGExprConstant.cpp
1414	Indeed, this is the missing case. Thank you!

rjmccall added inline comments.May 2 2018, 10:36 AM

lib/CodeGen/CGExprConstant.cpp
1403	Do you actually care if it's an array initialization instead of a struct/enum initialization?
1413	Aren't the null-pointer and integer-constant-expression checks below already checking this? Also, `isEvaluatable` actually computes the full value internally (as an `APValue`), so if you're worried about the memory and compile-time effects of producing such a value, you really shouldn't call it. You could reasonably move this entire function to be a method on `Expr` that takes an `ASTContext`.
1417	There's a `isNullPointerConstant` method (you should use `NPC_NeverValueDependent`).

Small simplification

sepavloff marked an inline comment as done.May 3 2018, 10:48 AM

sepavloff added inline comments.

lib/CodeGen/CGExprConstant.cpp
1403	If this code is enabled for for records too, some tests start to fail. For instance, the code: union { int i; double f; } u2 = { }; produces output: %union.anon = type { double } @u2 = global %union.anon zeroinitializer, align 4 while previously it produced: @u2 = global { i32, [4 x i8] } { i32 0, [4 x i8] undef }, align 4 The latter looks more correct.
1413	Comment for `EvaluateAsRValue` says that it tries calculate expression agressively. Indeed, for the code: decltype(nullptr) null(); int *p = null(); compiler ignores potential side effect of `null()` and removes the call, leaving only zero initialization. `isNullPointerConstant` behaves similarly.
1417	It make code more readable. Thank you!

rsmith added inline comments.May 3 2018, 5:17 PM

lib/CodeGen/CGExprConstant.cpp
1413	Nonetheless, it looks like this function could evaluate `Init` up to three times, which seems unreasonable. Instead of the checks based on trying to evaluate the initializer (`isNullPointerConstant` + `isIntegerConstantExpr`), how about calling `VarDecl::evaluateValue()` (which will return a potentially pre-computed and cached initializer value) and checking if the result is a zero constant? In fact, `tryEmitPrivateForVarInit` already does most of that for you, and the right place to make this change is probably in `tryEmitPrivateForMemory`, where you can test to see if the `APValue` is zero-initialized and produce a `zeroinitializer` if so. As a side-benefit, putting the change there will mean we'll also start using `zeroinitializer` for zero-initialized subobjects of objects that have non-zero pieces.

Avoid redundant initializer calculation

Harbormaster completed remote builds in B17773: Diff 145445.May 7 2018, 4:29 AM

sepavloff added inline comments.May 7 2018, 5:31 AM

lib/CodeGen/CGExprConstant.cpp
1413	An important point in this patch is that CodeGen tries to find out, if the initializer can be replaced with zeroinitializer, prior to the evaluation of it. For huge arrays the evaluation consumes huge amount of memory and time and it must be avoided. With this patch CodeGen may evaluate parts of the initializer, if it is represented by `InitListExpr`. It may cause redundant calculation, for instance if the check for zero initialization failed but the initializer is constant. To avoid this redundancy we could cache result of evaluation in instances of `Expr` and then use the partial values in the evaluation of the initializer. The simple use case solved by this patch probably is not a sufficient justification for such redesign.

rjmccall added inline comments.May 7 2018, 10:36 AM

lib/CodeGen/CGExprConstant.cpp
1403	Hmm. In C++, a static object which isn't constant-initialized is zero-initialized, which is required to set any padding bits to zero (N4640 [dcl.init]p6) in both structs and unions. In C, a static object which doesn't have an initializer also has all any padding bits set to zero (N1548 6.7.9p10). Now, this object has an initializer (that acts as a constant-initializer in C++), so those rules don't directly apply; but I would argue that the intent of the standards is not that padding bits become undefined when you happen to have an initializer. So I actually think the `zeroinitializer` emission is more correct.

Added treatment of structures/unions

Harbormaster completed remote builds in B17844: Diff 145735.May 8 2018, 11:07 AM

Hmm. In C++, a static object which isn't constant-initialized is zero-initialized, which is required to set any padding bits to zero (N4640 [dcl.init]p6) in both structs and unions. In C, a static object which doesn't have an initializer also has all any padding bits set to zero (N1548 6.7.9p10). Now, this object has an initializer (that acts as a constant-initializer in C++), so those rules don't directly apply; but I would argue that the intent of the standards is not that padding bits become undefined when you happen to have an initializer. So I actually think the zeroinitializer emission is more correct.

Using undefined values instead of zero initialization was implemented in https://reviews.llvm.org/rL101535. There is no much info about the reason of the implementation. Clang uses undefined values for padding bits, in particular in unions, when the first member is not widest. The code:

    union C1 {
	  char sel;
	  double dval;
	};
	union C1 val_1 = { 0 };

produces:

@val_1 = dso_local global { i8, [7 x i8] } { i8 0, [7 x i8] undef }, align 8

Another case is unnamed bit fields.

	struct C2 {
	  int : 4;
	  int x;
	};
	struct C2 val_2 = { 0 };

produces:

@val_2 = dso_local global { [4 x i8], i32 } { [4 x i8] undef, i32 0 }, align 4

Strictly speaking, this IR does not mean violation of the standard, but it can modify code generation in some cases. If we decided to use zeroinitializer in this case, we probably would need to revise using undefined values in initializers, otherwise similar declarations like:

	union C1 val_1a = { 0 };
	union C1 val_1b = { 1 };

would produce different IR representations, with and without undefined values.

The test SemaCXX/large-array-init.cpp is removed in this change. This test was added in https://reviews.llvm.org/rL325120 to solve https://bugs.llvm.org/show_bug.cgi?id=18978, which describes the same problem, as solved by this patch. This patch presents more efficient solution, with it the tests compiles 50 time faster. If r325120 does not solve additional problems, it can be reverted.

sepavloff added subscribers: nlopes, kosarev.May 8 2018, 11:12 AM

rjmccall added inline comments.May 9 2018, 3:14 PM

lib/CodeGen/CGExprConstant.cpp
1413	I really think you should have some sort of type restriction in front of this so that you don't end up creating a huge APValue only to throw it away because it's not an int or a pointer. It's quite possible to have something like an array initializer in a nested position that's not within an InitListExpr , e.g. due to compound literals or std::initializer_list.

Updated patch

Try evaluating initializer value only if the initializer type is
integral or pointer. It avoids calculation of large value, which
then is discarded.

Thanks, LGTM.

This revision is now accepted and ready to land.May 10 2018, 8:26 AM

rsmith added inline comments.May 10 2018, 2:53 PM

lib/CodeGen/CGExprConstant.cpp
1414–1415	Please call `D.evaluateValue()` here rather than inventing your own evaluation scheme. That way, we'll cache the evaluated initializer on the variable for other uses or reuse the value if we've already evaluated it, and you don't need to worry about the corner cases involved in getting the evaluation right. (As it happens, you're getting some minor details wrong because evaluating an initializer is not quite the same as evaluating an rvalue, but in practice it's not a big deal here.)

sepavloff added inline comments.May 10 2018, 10:01 PM

lib/CodeGen/CGExprConstant.cpp
1414–1415	Call of `D.evaluateValue()` may result in substantial memory and time consumption if the variable value is huge, like in: int char data_1[2147483648u] = { 0 }; The idea of this patch is to recognize some cases of zero initialization prior to the evaluation of variable initializer. In the example above, value would be evaluated only for part of the initializer, namely `0`, which does not have an associated variable, so call of `D.evaluateValue()` is not possible.

@rsmith Do yo think this patch is OK to apply?

Closed by commit rC332847: [CodeGen] Recognize more cases of zero initialization (authored by sepavloff). · Explain WhyMay 21 2018, 9:14 AM

This revision was automatically updated to reflect the committed changes.

rsmith added inline comments.May 21 2018, 2:12 PM

lib/CodeGen/CGExprConstant.cpp
1414–1415	As noted, `EvaluateAsRValue` gets some of the details here wrong. I reverted this in r332886 because of a miscompile due to this fact.

I've done some more investigation, and I now think this patch is merely working around deficiencies elsewhere. See https://reviews.llvm.org/D47166, which aims to fix those deficiencies more directly.

Revision Contents

Path

Size

include/

clang/

AST/

Expr.h

19 lines

lib/

AST/

ExprConstant.cpp

12 lines

CodeGen/

CGExprConstant.cpp

40 lines

test/

CodeGen/

const-init.c

2 lines

designated-initializers.c

2 lines

union-init2.c

2 lines

CodeGenCXX/

cxx11-initializer-aggregate.cpp

27 lines

cxx1z-initializer-aggregate.cpp

4 lines

SemaCXX/

large-array-init.cpp

10 lines

Diff 147798

include/clang/AST/Expr.h

Show First 20 Lines • Show All 531 Lines • ▼ Show 20 Lines	public:

/// isConstantInitializer - Returns true if this expression can be emitted to		/// isConstantInitializer - Returns true if this expression can be emitted to
/// IR as a constant, and thus can be used as a constant initializer in C.		/// IR as a constant, and thus can be used as a constant initializer in C.
/// If this expression is not constant and Culprit is non-null,		/// If this expression is not constant and Culprit is non-null,
/// it is used to store the address of first non constant expr.		/// it is used to store the address of first non constant expr.
bool isConstantInitializer(ASTContext &Ctx, bool ForRef,		bool isConstantInitializer(ASTContext &Ctx, bool ForRef,
const Expr **Culprit = nullptr) const;		const Expr **Culprit = nullptr) const;

		enum SideEffectsKind {
		SE_NoSideEffects, ///< Strictly evaluate the expression.
		SE_AllowUndefinedBehavior, ///< Allow UB that we can give a value, but not
		///< arbitrary unmodeled side effects.
		SE_AllowSideEffects ///< Allow any unmodeled side effect.
		};

/// EvalStatus is a struct with detailed info about an evaluation in progress.		/// EvalStatus is a struct with detailed info about an evaluation in progress.
struct EvalStatus {		struct EvalStatus {
/// Whether the evaluated expression has side effects.		/// Whether the evaluated expression has side effects.
/// For example, (f() && 0) can be folded, but it still has side effects.		/// For example, (f() && 0) can be folded, but it still has side effects.
bool HasSideEffects;		bool HasSideEffects;

/// Whether the evaluation hit undefined behavior.		/// Whether the evaluation hit undefined behavior.
/// For example, 1.0 / 0.0 can be folded to Inf, but has undefined behavior.		/// For example, 1.0 / 0.0 can be folded to Inf, but has undefined behavior.
Show All 12 Lines	struct EvalStatus {
EvalStatus()		EvalStatus()
: HasSideEffects(false), HasUndefinedBehavior(false), Diag(nullptr) {}		: HasSideEffects(false), HasUndefinedBehavior(false), Diag(nullptr) {}

// hasSideEffects - Return true if the evaluated expression has		// hasSideEffects - Return true if the evaluated expression has
// side effects.		// side effects.
bool hasSideEffects() const {		bool hasSideEffects() const {
return HasSideEffects;		return HasSideEffects;
}		}

		bool hasUnacceptableSideEffect(SideEffectsKind SEK) {
		return (SEK < SE_AllowSideEffects && HasSideEffects) \|\|
		(SEK < SE_AllowUndefinedBehavior && HasUndefinedBehavior);
		}
};		};

/// EvalResult is a struct with detailed info about an evaluated expression.		/// EvalResult is a struct with detailed info about an evaluated expression.
struct EvalResult : EvalStatus {		struct EvalResult : EvalStatus {
/// Val - This is the value the expression can be folded to.		/// Val - This is the value the expression can be folded to.
APValue Val;		APValue Val;

// isGlobalLValue - Return true if the evaluated lvalue expression		// isGlobalLValue - Return true if the evaluated lvalue expression
Show All 10 Lines	public:
bool EvaluateAsRValue(EvalResult &Result, const ASTContext &Ctx) const;		bool EvaluateAsRValue(EvalResult &Result, const ASTContext &Ctx) const;

/// EvaluateAsBooleanCondition - Return true if this is a constant		/// EvaluateAsBooleanCondition - Return true if this is a constant
/// which we can fold and convert to a boolean condition using		/// which we can fold and convert to a boolean condition using
/// any crazy technique that we want to, even if the expression has		/// any crazy technique that we want to, even if the expression has
/// side-effects.		/// side-effects.
bool EvaluateAsBooleanCondition(bool &Result, const ASTContext &Ctx) const;		bool EvaluateAsBooleanCondition(bool &Result, const ASTContext &Ctx) const;

enum SideEffectsKind {
SE_NoSideEffects, ///< Strictly evaluate the expression.
SE_AllowUndefinedBehavior, ///< Allow UB that we can give a value, but not
///< arbitrary unmodeled side effects.
SE_AllowSideEffects ///< Allow any unmodeled side effect.
};

/// EvaluateAsInt - Return true if this is a constant which we can fold and		/// EvaluateAsInt - Return true if this is a constant which we can fold and
/// convert to an integer, using any crazy technique that we want to.		/// convert to an integer, using any crazy technique that we want to.
bool EvaluateAsInt(llvm::APSInt &Result, const ASTContext &Ctx,		bool EvaluateAsInt(llvm::APSInt &Result, const ASTContext &Ctx,
SideEffectsKind AllowSideEffects = SE_NoSideEffects) const;		SideEffectsKind AllowSideEffects = SE_NoSideEffects) const;

/// EvaluateAsFloat - Return true if this is a constant which we can fold and		/// EvaluateAsFloat - Return true if this is a constant which we can fold and
/// convert to a floating point value, using any crazy technique that we		/// convert to a floating point value, using any crazy technique that we
/// want to.		/// want to.
▲ Show 20 Lines • Show All 4,683 Lines • Show Last 20 Lines

lib/AST/ExprConstant.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 10,306 Lines • ▼ Show 20 Lines

bool Expr::EvaluateAsBooleanCondition(bool &Result,		bool Expr::EvaluateAsBooleanCondition(bool &Result,
const ASTContext &Ctx) const {		const ASTContext &Ctx) const {
EvalResult Scratch;		EvalResult Scratch;
return EvaluateAsRValue(Scratch, Ctx) &&		return EvaluateAsRValue(Scratch, Ctx) &&
HandleConversionToBool(Scratch.Val, Result);		HandleConversionToBool(Scratch.Val, Result);
}		}

static bool hasUnacceptableSideEffect(Expr::EvalStatus &Result,
Expr::SideEffectsKind SEK) {
return (SEK < Expr::SE_AllowSideEffects && Result.HasSideEffects) \|\|
(SEK < Expr::SE_AllowUndefinedBehavior && Result.HasUndefinedBehavior);
}

bool Expr::EvaluateAsInt(APSInt &Result, const ASTContext &Ctx,		bool Expr::EvaluateAsInt(APSInt &Result, const ASTContext &Ctx,
SideEffectsKind AllowSideEffects) const {		SideEffectsKind AllowSideEffects) const {
if (!getType()->isIntegralOrEnumerationType())		if (!getType()->isIntegralOrEnumerationType())
return false;		return false;

EvalResult ExprResult;		EvalResult ExprResult;
if (!EvaluateAsRValue(ExprResult, Ctx) \|\| !ExprResult.Val.isInt() \|\|		if (!EvaluateAsRValue(ExprResult, Ctx) \|\| !ExprResult.Val.isInt() \|\|
hasUnacceptableSideEffect(ExprResult, AllowSideEffects))		ExprResult.hasUnacceptableSideEffect(AllowSideEffects))
return false;		return false;

Result = ExprResult.Val.getInt();		Result = ExprResult.Val.getInt();
return true;		return true;
}		}

bool Expr::EvaluateAsFloat(APFloat &Result, const ASTContext &Ctx,		bool Expr::EvaluateAsFloat(APFloat &Result, const ASTContext &Ctx,
SideEffectsKind AllowSideEffects) const {		SideEffectsKind AllowSideEffects) const {
if (!getType()->isRealFloatingType())		if (!getType()->isRealFloatingType())
return false;		return false;

EvalResult ExprResult;		EvalResult ExprResult;
if (!EvaluateAsRValue(ExprResult, Ctx) \|\| !ExprResult.Val.isFloat() \|\|		if (!EvaluateAsRValue(ExprResult, Ctx) \|\| !ExprResult.Val.isFloat() \|\|
hasUnacceptableSideEffect(ExprResult, AllowSideEffects))		ExprResult.hasUnacceptableSideEffect(AllowSideEffects))
return false;		return false;

Result = ExprResult.Val.getFloat();		Result = ExprResult.Val.getFloat();
return true;		return true;
}		}

bool Expr::EvaluateAsLValue(EvalResult &Result, const ASTContext &Ctx) const {		bool Expr::EvaluateAsLValue(EvalResult &Result, const ASTContext &Ctx) const {
EvalInfo Info(Ctx, Result, EvalInfo::EM_ConstantFold);		EvalInfo Info(Ctx, Result, EvalInfo::EM_ConstantFold);
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	return CheckConstantExpression(InitInfo, VD->getLocation(), VD->getType(),
Value);		Value);
}		}

/// isEvaluatable - Call EvaluateAsRValue to see if this expression can be		/// isEvaluatable - Call EvaluateAsRValue to see if this expression can be
/// constant folded, but discard the result.		/// constant folded, but discard the result.
bool Expr::isEvaluatable(const ASTContext &Ctx, SideEffectsKind SEK) const {		bool Expr::isEvaluatable(const ASTContext &Ctx, SideEffectsKind SEK) const {
EvalResult Result;		EvalResult Result;
return EvaluateAsRValue(Result, Ctx) &&		return EvaluateAsRValue(Result, Ctx) &&
!hasUnacceptableSideEffect(Result, SEK);		!Result.hasUnacceptableSideEffect(SEK);
}		}

APSInt Expr::EvaluateKnownConstInt(const ASTContext &Ctx,		APSInt Expr::EvaluateKnownConstInt(const ASTContext &Ctx,
SmallVectorImpl<PartialDiagnosticAt> *Diag) const {		SmallVectorImpl<PartialDiagnosticAt> *Diag) const {
EvalResult EvalResult;		EvalResult EvalResult;
EvalResult.Diag = Diag;		EvalResult.Diag = Diag;
bool Result = EvaluateAsRValue(EvalResult, Ctx);		bool Result = EvaluateAsRValue(EvalResult, Ctx);
(void)Result;		(void)Result;
▲ Show 20 Lines • Show All 651 Lines • Show Last 20 Lines

lib/CodeGen/CGExprConstant.cpp

	Show First 20 Lines • Show All 1,386 Lines • ▼ Show 20 Lines
	static QualType getNonMemoryType(CodeGenModule &CGM, QualType type) {			static QualType getNonMemoryType(CodeGenModule &CGM, QualType type) {
	if (auto AT = type->getAs<AtomicType>()) {			if (auto AT = type->getAs<AtomicType>()) {
	return CGM.getContext().getQualifiedType(AT->getValueType(),			return CGM.getContext().getQualifiedType(AT->getValueType(),
	type.getQualifiers());			type.getQualifiers());
	}			}
	return type;			return type;
	}			}

				/// Checks if the specified initializer is equivalent to zero initialization.
				rjmccallUnsubmitted Done Reply Inline Actions You should have a comment here clarifying that this is checking whether the initializer is equivalent to a C++ zero initialization, not checking whether the initializer produces a zero bit-pattern. rjmccall: You should have a comment here clarifying that this is checking whether the initializer is…
				static bool isZeroInitializer(ConstantEmitter &CE, const Expr *Init) {
				if (auto *E = dyn_cast_or_null<CXXConstructExpr>(Init)) {
				CXXConstructorDecl *CD = E->getConstructor();
				return CD->isDefaultConstructor() && CD->isTrivial();
				}

				if (auto *IL = dyn_cast_or_null<InitListExpr>(Init)) {
				for (auto I : IL->inits())
				rjmccallUnsubmitted Not Done Reply Inline Actions Do you actually care if it's an array initialization instead of a struct/enum initialization? rjmccall: Do you actually care if it's an array initialization instead of a struct/enum initialization?
				sepavloffAuthorUnsubmitted Not Done Reply Inline Actions If this code is enabled for for records too, some tests start to fail. For instance, the code: union { int i; double f; } u2 = { }; produces output: %union.anon = type { double } @u2 = global %union.anon zeroinitializer, align 4 while previously it produced: @u2 = global { i32, [4 x i8] } { i32 0, [4 x i8] undef }, align 4 The latter looks more correct. sepavloff: If this code is enabled for for records too, some tests start to fail. For instance, the code…
				rjmccallUnsubmitted Not Done Reply Inline Actions Hmm. In C++, a static object which isn't constant-initialized is zero-initialized, which is required to set any padding bits to zero (N4640 [dcl.init]p6) in both structs and unions. In C, a static object which doesn't have an initializer also has all any padding bits set to zero (N1548 6.7.9p10). Now, this object has an initializer (that acts as a constant-initializer in C++), so those rules don't directly apply; but I would argue that the intent of the standards is not that padding bits become undefined when you happen to have an initializer. So I actually think the `zeroinitializer` emission is more correct. rjmccall: Hmm. In C++, a static object which isn't constant-initialized is zero-initialized, which is…
				if (!isZeroInitializer(CE, I))
				return false;
				if (const Expr *Filler = IL->getArrayFiller())
				return isZeroInitializer(CE, Filler);
				return true;
				}

				QualType InitTy = Init->getType();
				if (InitTy->isIntegralOrEnumerationType() \|\| InitTy->isPointerType()) {
				Expr::EvalResult Result;
				rjmccallUnsubmitted Not Done Reply Inline Actions Aren't the null-pointer and integer-constant-expression checks below already checking this? Also, `isEvaluatable` actually computes the full value internally (as an `APValue`), so if you're worried about the memory and compile-time effects of producing such a value, you really shouldn't call it. You could reasonably move this entire function to be a method on `Expr` that takes an `ASTContext`. rjmccall: Aren't the null-pointer and integer-constant-expression checks below already checking this?
				sepavloffAuthorUnsubmitted Not Done Reply Inline Actions Comment for `EvaluateAsRValue` says that it tries calculate expression agressively. Indeed, for the code: decltype(nullptr) null(); int p = null(); compiler ignores potential side effect of `null()` and removes the call, leaving only zero initialization. `isNullPointerConstant` behaves similarly. sepavloff:* Comment for `EvaluateAsRValue` says that it tries calculate expression agressively. Indeed, for…
				rsmithUnsubmitted Not Done Reply Inline Actions Nonetheless, it looks like this function could evaluate `Init` up to three times, which seems unreasonable. Instead of the checks based on trying to evaluate the initializer (`isNullPointerConstant` + `isIntegerConstantExpr`), how about calling `VarDecl::evaluateValue()` (which will return a potentially pre-computed and cached initializer value) and checking if the result is a zero constant? In fact, `tryEmitPrivateForVarInit` already does most of that for you, and the right place to make this change is probably in `tryEmitPrivateForMemory`, where you can test to see if the `APValue` is zero-initialized and produce a `zeroinitializer` if so. As a side-benefit, putting the change there will mean we'll also start using `zeroinitializer` for zero-initialized subobjects of objects that have non-zero pieces. rsmith: Nonetheless, it looks like this function could evaluate `Init` up to three times, which seems…
				sepavloffAuthorUnsubmitted Not Done Reply Inline Actions An important point in this patch is that CodeGen tries to find out, if the initializer can be replaced with zeroinitializer, prior to the evaluation of it. For huge arrays the evaluation consumes huge amount of memory and time and it must be avoided. With this patch CodeGen may evaluate parts of the initializer, if it is represented by `InitListExpr`. It may cause redundant calculation, for instance if the check for zero initialization failed but the initializer is constant. To avoid this redundancy we could cache result of evaluation in instances of `Expr` and then use the partial values in the evaluation of the initializer. The simple use case solved by this patch probably is not a sufficient justification for such redesign. sepavloff: An important point in this patch is that CodeGen tries to find out, if the initializer can be…
				rjmccallUnsubmitted Not Done Reply Inline Actions I really think you should have some sort of type restriction in front of this so that you don't end up creating a huge APValue only to throw it away because it's not an int or a pointer. It's quite possible to have something like an array initializer in a nested position that's not within an InitListExpr , e.g. due to compound literals or std::initializer_list. rjmccall: I really think you should have some sort of type restriction in front of this so that you don't…
				if (Init->EvaluateAsRValue(Result, CE.CGM.getContext()) &&
				rjmccallUnsubmitted Done Reply Inline Actions You should check the array fill expression here; for now, the test case would be something like `StructWithNonTrivialDefaultInit sArr[10] = {};`. rjmccall: You should check the array fill expression here; for now, the test case would be something like…
				sepavloffAuthorUnsubmitted Not Done Reply Inline Actions Indeed, this is the missing case. Thank you! sepavloff: Indeed, this is the missing case. Thank you!
				!Result.hasUnacceptableSideEffect(Expr::SE_NoSideEffects))
				rjmccallUnsubmitted Done Reply Inline Actions I would suggest doing your primary switch over the form of Init instead of its type, and you can just have an isIntegerConstantExpr check at the end. You should also check for a null pointer expression. rjmccall: I would suggest doing your primary switch over the form of Init instead of its type, and you…
				rsmithUnsubmitted Not Done Reply Inline Actions Please call `D.evaluateValue()` here rather than inventing your own evaluation scheme. That way, we'll cache the evaluated initializer on the variable for other uses or reuse the value if we've already evaluated it, and you don't need to worry about the corner cases involved in getting the evaluation right. (As it happens, you're getting some minor details wrong because evaluating an initializer is not quite the same as evaluating an rvalue, but in practice it's not a big deal here.) rsmith: Please call `D.evaluateValue()` here rather than inventing your own evaluation scheme. That way…
				sepavloffAuthorUnsubmitted Not Done Reply Inline Actions Call of `D.evaluateValue()` may result in substantial memory and time consumption if the variable value is huge, like in: int char data_1[2147483648u] = { 0 }; The idea of this patch is to recognize some cases of zero initialization prior to the evaluation of variable initializer. In the example above, value would be evaluated only for part of the initializer, namely `0`, which does not have an associated variable, so call of `D.evaluateValue()` is not possible. sepavloff: Call of `D.evaluateValue()` may result in substantial memory and time consumption if the…
				rsmithUnsubmitted Not Done Reply Inline Actions As noted, `EvaluateAsRValue` gets some of the details here wrong. I reverted this in r332886 because of a miscompile due to this fact. rsmith: As noted, `EvaluateAsRValue` gets some of the details here wrong. I reverted this in r332886…
				return (Result.Val.isInt() && Result.Val.getInt().isNullValue()) \|\|
				(Result.Val.isLValue() && Result.Val.isNullPointer());
				rjmccallUnsubmitted Done Reply Inline Actions There's a `isNullPointerConstant` method (you should use `NPC_NeverValueDependent`). rjmccall: There's a `isNullPointerConstant` method (you should use `NPC_NeverValueDependent`).
				sepavloffAuthorUnsubmitted Not Done Reply Inline Actions It make code more readable. Thank you! sepavloff: It make code more readable. Thank you!
				}

				return false;
				}

	llvm::Constant *ConstantEmitter::tryEmitPrivateForVarInit(const VarDecl &D) {			llvm::Constant *ConstantEmitter::tryEmitPrivateForVarInit(const VarDecl &D) {
	// Make a quick check if variable can be default NULL initialized			// Make a quick check if variable can be default NULL initialized
	// and avoid going through rest of code which may do, for c++11,			// and avoid going through rest of code which may do, for c++11,
	// initialization of memory to all NULLs.			// initialization of memory to all NULLs.
	if (!D.hasLocalStorage()) {			if (!D.hasLocalStorage() && isZeroInitializer(*this, D.getInit()))
	QualType Ty = CGM.getContext().getBaseElementType(D.getType());
	if (Ty->isRecordType())
	if (const CXXConstructExpr *E =
	dyn_cast_or_null<CXXConstructExpr>(D.getInit())) {
	const CXXConstructorDecl *CD = E->getConstructor();
	if (CD->isTrivial() && CD->isDefaultConstructor())
	return CGM.EmitNullConstant(D.getType());			return CGM.EmitNullConstant(D.getType());
	}
	}

	QualType destType = D.getType();			QualType destType = D.getType();

	// Try to emit the initializer. Note that this can allow some things that			// Try to emit the initializer. Note that this can allow some things that
	// are not allowed by tryEmitPrivateForMemory alone.			// are not allowed by tryEmitPrivateForMemory alone.
	if (auto value = D.evaluateValue()) {			if (auto value = D.evaluateValue()) {
	return tryEmitPrivateForMemory(*value, destType);			return tryEmitPrivateForMemory(*value, destType);
	}			}
	▲ Show 20 Lines • Show All 715 Lines • Show Last 20 Lines

test/CodeGen/const-init.c

	Show First 20 Lines • Show All 161 Lines • ▼ Show 20 Lines

	// PR21300			// PR21300
	void g30() {			void g30() {
	#pragma pack(1)			#pragma pack(1)
	static struct {			static struct {
	int : 1;			int : 1;
	int x;			int x;
	} a = {};			} a = {};
	// CHECK: @g30.a = internal global %struct.anon.1 <{ i8 undef, i32 0 }>, align 1			// CHECK: @g30.a = internal global %struct.anon.1 zeroinitializer, align 1
	#pragma pack()			#pragma pack()
	}			}

	void g31() {			void g31() {
	#pragma pack(4)			#pragma pack(4)
	static struct {			static struct {
	short a;			short a;
	long x;			long x;
	short z;			short z;
	} a = {23122, -12312731, -312};			} a = {23122, -12312731, -312};
	#pragma pack()			#pragma pack()
	// CHECK: @g31.a = internal global %struct.anon.2 { i16 23122, i32 -12312731, i16 -312 }, align 4			// CHECK: @g31.a = internal global %struct.anon.2 { i16 23122, i32 -12312731, i16 -312 }, align 4
	}			}

test/CodeGen/designated-initializers.c

	// RUN: %clang_cc1 -triple i386-unknown-unknown %s -emit-llvm -o - \| FileCheck %s			// RUN: %clang_cc1 -triple i386-unknown-unknown %s -emit-llvm -o - \| FileCheck %s

	struct foo {			struct foo {
	void *a;			void *a;
	int b;			int b;
	};			};

	// CHECK: @u = global %union.anon zeroinitializer			// CHECK: @u = global %union.anon zeroinitializer
	union { int i; float f; } u = { };			union { int i; float f; } u = { };

	// CHECK: @u2 = global { i32, [4 x i8] } { i32 0, [4 x i8] undef }			// CHECK: @u2 = global %union.anon.0 zeroinitializer
	union { int i; double f; } u2 = { };			union { int i; double f; } u2 = { };

	// CHECK: @u3 = global %union.anon.1 zeroinitializer			// CHECK: @u3 = global %union.anon.1 zeroinitializer
	union { double f; int i; } u3 = { };			union { double f; int i; } u3 = { };

	// CHECK: @b = global [2 x i32] [i32 0, i32 22]			// CHECK: @b = global [2 x i32] [i32 0, i32 22]
	int b[2] = {			int b[2] = {
	[1] = 22			[1] = 22
	▲ Show 20 Lines • Show All 160 Lines • Show Last 20 Lines

test/CodeGen/union-init2.c

	// RUN: %clang_cc1 -emit-llvm %s -o - -triple i686-pc-linux-gnu \| FileCheck %s			// RUN: %clang_cc1 -emit-llvm %s -o - -triple i686-pc-linux-gnu \| FileCheck %s

	// Make sure we generate something sane instead of a ptrtoint			// Make sure we generate something sane instead of a ptrtoint
	// CHECK: bitcast ({ %union.x, [4 x i8] } @r to %union.x*), [4 x i8] undef			// CHECK: bitcast ({ %union.x, [4 x i8] } @r to %union.x*), [4 x i8] undef
	union x {long long b;union x* a;} r = {.a = &r};			union x {long long b;union x* a;} r = {.a = &r};


	// CHECK: global { [3 x i8], [5 x i8] } { [3 x i8] zeroinitializer, [5 x i8] undef }			// CHECK: global %union.z zeroinitializer
	union z {			union z {
	char a[3];			char a[3];
	long long b;			long long b;
	};			};
	union z y = {};			union z y = {};

test/CodeGenCXX/cxx11-initializer-aggregate.cpp

	Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
	// CHECK: call i32 @__cxa_atexit(			// CHECK: call i32 @__cxa_atexit(
	namespace NonTrivialInit {			namespace NonTrivialInit {
	struct A { A(); A(const A&) = delete; ~A(); };			struct A { A(); A(const A&) = delete; ~A(); };
	struct B { A a[20]; };			struct B { A a[20]; };
	// NB, this must be large enough to be worth memsetting for this test to be			// NB, this must be large enough to be worth memsetting for this test to be
	// meaningful.			// meaningful.
	B b[30] = {};			B b[30] = {};
	}			}

				namespace ZeroInit {
				enum { Zero, One };
				constexpr int zero() { return 0; }
				constexpr int *null() { return nullptr; }
				struct Filler {
				int x;
				Filler();
				};
				struct S1 {
				int x;
				};

				// These declarations, if implemented elementwise, require huge
				// amout of memory and compiler time.
				unsigned char data_1[1024 * 1024 * 1024 * 2u] = { 0 };
				unsigned char data_2[1024 * 1024 * 1024 * 2u] = { Zero };
				unsigned char data_3[1024][1024][1024] = {{{0}}};
				unsigned char data_4[1024 * 1024 * 1024 * 2u] = { zero() };
				int data_5[1024 1024 * 512] = { nullptr };
				int data_6[1024 1024 * 512] = { null() };
				struct S1 data_7[1024 * 1024 * 512] = {{0}};

				// This variable must be initialized elementwise.
				Filler data_e1[1024] = {};
				// CHECK: getelementptr inbounds {{.*}} @_ZN8ZeroInit7data_e1E
				}

test/CodeGenCXX/cxx1z-initializer-aggregate.cpp

Show All 11 Lines	namespace Constant {
};		};

struct C : B {};		struct C : B {};

struct D : A, C {};		struct D : A, C {};

C c1 = {};		C c1 = {};
C c2 = {1};		C c2 = {1};
// CHECK: @_ZN8Constant2c1E = global { i8 } zeroinitializer, align 1		// CHECK: @_ZN8Constant2c1E = global %"struct.Constant::C" zeroinitializer, align 1
// CHECK: @_ZN8Constant2c2E = global { i8 } { i8 1 }, align 1		// CHECK: @_ZN8Constant2c2E = global { i8 } { i8 1 }, align 1

// Test packing bases into tail padding.		// Test packing bases into tail padding.
D d1 = {};		D d1 = {};
D d2 = {1, 2, 3};		D d2 = {1, 2, 3};
D d3 = {1};		D d3 = {1};
// CHECK: @_ZN8Constant2d1E = global { i32, i8, i8 } zeroinitializer, align 4		// CHECK: @_ZN8Constant2d1E = global %"struct.Constant::D" zeroinitializer, align 4
// CHECK: @_ZN8Constant2d2E = global { i32, i8, i8 } { i32 1, i8 2, i8 3 }, align 4		// CHECK: @_ZN8Constant2d2E = global { i32, i8, i8 } { i32 1, i8 2, i8 3 }, align 4
// CHECK: @_ZN8Constant2d3E = global { i32, i8, i8 } { i32 1, i8 0, i8 0 }, align 4		// CHECK: @_ZN8Constant2d3E = global { i32, i8, i8 } { i32 1, i8 0, i8 0 }, align 4

// CHECK-LABEL: define {{.*}}global_var_init		// CHECK-LABEL: define {{.*}}global_var_init
// CHECK: call {{.}} @__cxa_atexit({{.}} @_ZN8Constant1DD1Ev {{.*}} @_ZN8Constant2d1E		// CHECK: call {{.}} @__cxa_atexit({{.}} @_ZN8Constant1DD1Ev {{.*}} @_ZN8Constant2d1E

// CHECK-LABEL: define {{.*}}global_var_init		// CHECK-LABEL: define {{.*}}global_var_init
// CHECK: call {{.}} @__cxa_atexit({{.}} @_ZN8Constant1DD1Ev {{.*}} @_ZN8Constant2d2E		// CHECK: call {{.}} @__cxa_atexit({{.}} @_ZN8Constant1DD1Ev {{.*}} @_ZN8Constant2d2E
▲ Show 20 Lines • Show All 97 Lines • Show Last 20 Lines

test/SemaCXX/large-array-init.cpp

	// RUN: %clang_cc1 -S -o %t.ll -mllvm -debug-only=exprconstant %s 2>&1 \| \
	// RUN: FileCheck %s
	// REQUIRES: asserts

	struct S { int i; };

	static struct S arr[100000000] = {{ 0 }};
	// CHECK: The number of elements to initialize: 1.

	struct S *foo() { return arr; }

This is an archive of the discontinued LLVM Phabricator instance.

[CodeGen] Recognize more cases of zero initializationClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 147798

include/clang/AST/Expr.h

lib/AST/ExprConstant.cpp

lib/CodeGen/CGExprConstant.cpp

test/CodeGen/const-init.c

test/CodeGen/designated-initializers.c

test/CodeGen/union-init2.c

test/CodeGenCXX/cxx11-initializer-aggregate.cpp

test/CodeGenCXX/cxx1z-initializer-aggregate.cpp

test/SemaCXX/large-array-init.cpp

[CodeGen] Recognize more cases of zero initialization
ClosedPublic