This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Basic/
-
clang/
-
Basic/
-
Builtins.def
-
lib/
-
CodeGen/
5/6
CGBuiltin.cpp
-
Sema/
-
SemaChecking.cpp
-
test/
-
CodeGenCXX/
8/8
builtin-zero-non-value-bits-codegen.cpp
1/4
builtin-zero-non-value-bits.cpp
-
SemaCXX/
1/1
builtin-zero-non-value-bits.cpp

Differential D87974

[Builtin] Add __builtin_zero_non_value_bits.
Needs ReviewPublic

Authored by zoecarver on Sep 19 2020, 1:25 PM.

Download Raw Diff

Details

Reviewers

jfb
rsmith
Bigcheese
__simt__

Summary

Adds __builtin_zero_non_value_bits to zero all padding bits of a struct. This builtin should match the behavior of those in NVCC and GCC (and MSVC?). There are some tests in this patch but hopefully we'll also get tests from other compilers (so all builtins can be as similar as possible).

I'm planning to add support for unions, bitfields (both as members and members of sub-objects), and booleans as follow up patches.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

zoecarver created this revision.Sep 19 2020, 1:25 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 19 2020, 1:25 PM

Herald added a subscriber: cfe-commits. · View Herald Transcript

zoecarver requested review of this revision.Sep 19 2020, 1:25 PM

zoecarver edited the summary of this revision. (Show Details)Sep 19 2020, 1:32 PM

zoecarver added reviewers: jfb, rsmith, Bigcheese, __simt__.

Herald added a subscriber: dexonsmith. · View Herald TranscriptSep 19 2020, 1:32 PM

tschuett added a subscriber: tschuett.Sep 19 2020, 1:42 PM

Harbormaster completed remote builds in B72289: Diff 292987.Sep 19 2020, 2:12 PM

jfb added inline comments.Sep 19 2020, 4:14 PM

clang/lib/CodeGen/CGBuiltin.cpp
1682	Typo in "fields".
clang/test/CodeGenCXX/builtin-zero-non-value-bits.cpp
161	Usually CodeGen tests will use lit to check the emitted IR matches expectations. I think that's what you want to do here. Remember to test `volatile` qualified pointers, as well as address spaces too.
clang/test/SemaCXX/builtin-zero-non-value-bits.cpp
12	You should also check incomplete types, vector, variable width integers, `const` qualified.

zoecarver added inline comments.Sep 20 2020, 1:35 PM

clang/test/CodeGenCXX/builtin-zero-non-value-bits.cpp
161	What's a good place for me to put this end-to-end test?

zoecarver retitled this revision from Summary: [Builtin] Add __builtin_zero_non_value_bits. to [Builtin] Add __builtin_zero_non_value_bits..Sep 22 2020, 9:32 AM

Add more test cases.
Fix typo.
Add codegen tests.

zoecarver marked 2 inline comments as done and an inline comment as not done.Sep 22 2020, 5:48 PM

Harbormaster completed remote builds in B72602: Diff 293606.Sep 22 2020, 6:45 PM

jfb added inline comments.Sep 23 2020, 10:10 AM

clang/lib/CodeGen/CGBuiltin.cpp
1647	I'd like to hear @rsmith's thoughts on this approach, in particular w.r.t. aliasing concerns. I also wonder if below the GEPs should be inbounds, depending on how they're created.
1652	You should use `alignmentAtOffset` here.
clang/test/CodeGenCXX/builtin-zero-non-value-bits-codegen.cpp
17	It would be helpful to have a comment with the final layout of the struct, including padding. Give each padding field a name, and reference them in the IR check below.
27	It would help read the tests if you had a comment on top of each store, for example here "padding byte X".
47	It would be useful to see a test for arrays with a type that contains tail padding.
clang/test/CodeGenCXX/builtin-zero-non-value-bits.cpp
161	I'm not sure, I don't usually add this type of test :)

zoecarver marked 3 inline comments as done.Sep 27 2020, 10:29 PM

zoecarver added inline comments.

clang/test/CodeGenCXX/builtin-zero-non-value-bits-codegen.cpp
17	Done. I've named each padding field as "PAD_X" so below it should be clear what fields are being stored without a comment.
27	See above comment.
47	Hmm, this test case doesn't seem to be working. I'll investigate further.
clang/test/CodeGenCXX/builtin-zero-non-value-bits.cpp
161	Even if it's a bit unconventional, I think it would be good to have this type of test. I think we should try to cover as many test cases as possible because it's important that this builtin both doesn't zero non-padding bits and does zero all padding bits. And it wouldn't be practical to add the 100+ test cases covered here as codegen tests.

Add UnsizedTail codegen test

Harbormaster completed remote builds in B73120: Diff 294603.Sep 27 2020, 11:05 PM

zoecarver marked an inline comment as done.Sep 30 2020, 5:57 PM

zoecarver added inline comments.

clang/test/CodeGenCXX/builtin-zero-non-value-bits-codegen.cpp
47	OK, I've added that. Just to clarify, you mean a type that contains a constant array type of types with tail padding (i.e., `Bar [2]`)?

Support constant arrays
Format changes with clang-format

zoecarver added inline comments.Sep 30 2020, 6:00 PM

clang/lib/CodeGen/CGBuiltin.cpp
1652	I'm using `CharUnits::One().alignmentAtOffset` to here because this type will always have a size of 1 because it's an `i8` ptr.

zoecarver added inline comments.Sep 30 2020, 6:08 PM

clang/lib/CodeGen/CGBuiltin.cpp
1704	Is it OK to possibly create hundreds of stores here? I assume later optimizations will catch this and turn it into a loop or a call to memset or something. But this could potentially be harmful to code size.

Harbormaster completed remote builds in B73596: Diff 295445.Sep 30 2020, 6:35 PM

@jfb (and others), friendly ping. Are there any other changes (especially to the logic) that you'd like me to make?

clang/lib/CodeGen/CGBuiltin.cpp
1647	Hmm, I don't know much about it but, I think these could be inbound. Because we will never actually go beyond the size of the llvm type.
clang/test/CodeGenCXX/builtin-zero-non-value-bits-codegen.cpp
47	Let me know if there are more codegen tests you'd like me to add. Happy to add them. (Maybe one for vector types, or volatile, or one that doesn't have an non-value bits?)

dexonsmith removed a subscriber: dexonsmith.Oct 19 2020, 5:41 PM

As of a few hours ago, GCC has __builtin_clear_padding, see https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html#index-_005f_005fbuiltin_005fclear_005fpadding for the docs.

In D87974#2408119, @jwakely wrote:

As of a few hours ago, GCC has __builtin_clear_padding, see https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html#index-_005f_005fbuiltin_005fclear_005fpadding for the docs.

Great. Then I'll update this to be named __builtin_clear_padding.

Let's make sure that we follow the same semantics that GCC does, particularly w.r.t. union, bitfields, and padding at the end of an object (whether it's in an array or not).

Let's make sure that we follow the same semantics that GCC does, particularly w.r.t. union, bitfields, and padding at the end of an object (whether it's in an array or not).

Agreed. I'm planning to run some tests tomorrow once the nightly build has updated.

I've resumed looking at the library code.

How should I check for support? Is it going to be e.g. __has_feature(__builtin_clear_padding)?

How should I check for support? Is it going to be e.g. has_feature(builtin_clear_padding)?

I'm not sure __has_feature will work but __has_builtin should work on both Clang and GCC.

@jwakely It looks like UnsizedTail causes a crash.

Other than that all the tests in this PR pass. It also looks like there's (at least) some support for unions and bitfields. This patch doesn't support those but I'm planning to add support as a follow-up.

In D87974#2409829, @zoecarver wrote:

@jwakely It looks like UnsizedTail causes a crash.

Filed https://gcc.gnu.org/PR97943 for that. Avoiding the crash is trivial, deciding what we want to do exactly for flexible array members (which are beyond the C++ standard) is harder.

Filed https://gcc.gnu.org/PR97943 for that. Avoiding the crash is trivial, deciding what we want to do exactly for flexible array members (which are beyond the C++ standard) is harder.

Yes, and we should try to do the same thing in this case. Currently, this implementation clears any padding bits that might come before the flexible array member but doesn't attempt to clear any of the array's padding bits (which I'm pretty sure wouldn't be feasible). So, what we're really deciding is whether or not to error for these types.

So, it looks like GCC already uses __builtin_clear_padding and MSVC already uses __builtin_zero_non_value_bits. This patch (obviously) is currently implementing __builtin_zero_non_value_bits but, I had planned to update it to use __builtin_clear_padding. Maybe that's not the best course of action, though.

We should all try to agree on _one_ name. CC @BillyONeal @jwakely thoughts?

In D87974#2432793, @zoecarver wrote:

So, it looks like GCC already uses __builtin_clear_padding and MSVC already uses __builtin_zero_non_value_bits. This patch (obviously) is currently implementing __builtin_zero_non_value_bits but, I had planned to update it to use __builtin_clear_padding. Maybe that's not the best course of action, though.

We should all try to agree on _one_ name. CC @BillyONeal @jwakely thoughts?

The name MSFT is already shipping in production is __builtin_zero_non_value_bits. If gcc is already shipping another name in production I think clang is stuck supporting both names, if gcc has not yet shipped their implementation perhaps we can choose one. That seems to be more on gcc than it is on clang given clang's desire to be more or less a drop in replacement for either gcc or msvc.

The MSFT STL implementation can of course use a different builtin when we detect clang.

In D87974#2436169, @BillyONeal wrote:

The name MSFT is already shipping in production is __builtin_zero_non_value_bits. If gcc is already shipping another name in production I think clang is stuck supporting both names, if gcc has not yet shipped their implementation perhaps we can choose one. That seems to be more on gcc than it is on clang given clang's desire to be more or less a drop in replacement for either gcc or msvc.

Are they actually the same, with the same handling of corner cases like unions and tail padding?

There's more to this than just the name, and if they aren't the same, it seems better to have two names.

Is there a specification for __builtin_zero_non_value_bits available somewhere?

Are they actually the same, with the same handling of corner cases like unions and tail padding?
There's more to this than just the name, and if they aren't the same, it seems better to have two names.

They are both implementing the same C++ feature, with the same desired semantics of zeroing out any bits in the object representation that aren't in the value representation. If they differ, one or the other would have a bug.

Is there a specification for __builtin_zero_non_value_bits available somewhere?

I don't know if there is a formal spec for it beyond the actual C++ standard.

In D87974#2438682, @BillyONeal wrote:

Are they actually the same, with the same handling of corner cases like unions and tail padding?
There's more to this than just the name, and if they aren't the same, it seems better to have two names.

They are both implementing the same C++ feature, with the same desired semantics of zeroing out any bits in the object representation that aren't in the value representation. If they differ, one or the other would have a bug.

I agree, they either need to be identical (including corner cases) or there needs to be two of them (i.e., GCC ships both __builtin_zero_non_value_bits and __builtin_clear_padding and the first is the same as MSVC, Clang, and NVCC).

Is there a specification for __builtin_zero_non_value_bits available somewhere?

I don't know if there is a formal spec for it beyond the actual C++ standard.

I think P0528 is the relevant paper but other than that, no, there's not a spec. I think that's going to be the most time sensitive part of implementing this: coming up with the spec and making sure all the tests pass on all the implementations.

I think Jonathan is asking whether there is a match in the gray areas.

The two cases people bring up most:

Unions, where the padding overlaps for all the possible active members.
Tail padding, up to the allocator granularity / alignment size.

If the implementation-specific builtins don't match on these, then maybe they should have different names, is his argument I think.

If the implementation-specific builtins don't match on these, then maybe they should have different names, is his argument I think.

That's a fair point. And I agree, if they don't match, maybe it would be best to have different names. I'm hoping that we can all agree on how to handle these gray areas, though.

In D87974#2438723, @zoecarver wrote:

In D87974#2438682, @BillyONeal wrote:

Are they actually the same, with the same handling of corner cases like unions and tail padding?
There's more to this than just the name, and if they aren't the same, it seems better to have two names.

They are both implementing the same C++ feature, with the same desired semantics of zeroing out any bits in the object representation that aren't in the value representation. If they differ, one or the other would have a bug.

Do they support non-trivially copyable types? That isn't required for the atomic compare exchange feature, but is relevant for a feature exposed to users. What about extensions like zero-sized arrays or C99 flexible array members?

I agree, they either need to be identical (including corner cases) or there needs to be two of them (i.e., GCC ships both __builtin_zero_non_value_bits and __builtin_clear_padding and the first is the same as MSVC, Clang, and NVCC).

GCC doesn't need to support both. It only works with libstdc++ so it only needs to support the one used by libstdc++ (although there is a patch to add -stdlib=libc++ to GCC).

If libstdc++ uses __has_builtin to check what the compiler supports then Clang doesn't even need to support GCC's built-in, because libstdc++ wouldn't use it if not supported (and could use __builtin_zero_non_value_bits instead when supported).

The Intel compiler would need to support both though.

Is there a specification for __builtin_zero_non_value_bits available somewhere?

I don't know if there is a formal spec for it beyond the actual C++ standard.

I think P0528 is the relevant paper but other than that, no, there's not a spec. I think that's going to be the most time sensitive part of implementing this: coming up with the spec and making sure all the tests pass on all the implementations.

GCC has publicly available documentation describing its built-in, and publicly available tests for it. That's the kind of spec I'm looking for.

In D87974#2439043, @jwakely wrote:

In D87974#2438723, @zoecarver wrote:

In D87974#2438682, @BillyONeal wrote:

Are they actually the same, with the same handling of corner cases like unions and tail padding?
There's more to this than just the name, and if they aren't the same, it seems better to have two names.

They are both implementing the same C++ feature, with the same desired semantics of zeroing out any bits in the object representation that aren't in the value representation. If they differ, one or the other would have a bug.

Do they support non-trivially copyable types? That isn't required for the atomic compare exchange feature, but is relevant for a feature exposed to users. What about extensions like zero-sized arrays or C99 flexible array members?

As far as MSVC is concerned this isn't "exposed to users".

What about extensions like zero-sized arrays or C99 flexible array members?

We don't have those extensions at all so it's irrelevant to talk about what the builtin would do with them in our case. (And at such time we would add such extensions presumably we would match gcc's behavior since again, there's no reason for the behavior to differ here)

I agree, they either need to be identical (including corner cases) or there needs to be two of them (i.e., GCC ships both __builtin_zero_non_value_bits and __builtin_clear_padding and the first is the same as MSVC, Clang, and NVCC).

GCC doesn't need to support both. It only works with libstdc++ so it only needs to support the one used by libstdc++ (although there is a patch to add -stdlib=libc++ to GCC).

If libstdc++ uses __has_builtin to check what the compiler supports then Clang doesn't even need to support GCC's built-in, because libstdc++ wouldn't use it if not supported (and could use __builtin_zero_non_value_bits instead when supported).

The Intel compiler would need to support both though.

Is there a specification for __builtin_zero_non_value_bits available somewhere?

I don't know if there is a formal spec for it beyond the actual C++ standard.

I think P0528 is the relevant paper but other than that, no, there's not a spec. I think that's going to be the most time sensitive part of implementing this: coming up with the spec and making sure all the tests pass on all the implementations.

GCC has publicly available documentation describing its built-in, and publicly available tests for it. That's the kind of spec I'm looking for.

We don't consider it "publicly available" so there isn't going to be that kind of documentation for it. I don't see a serious problem with the gcc version of that builtin supporting a superset of the functionality of the equivalent msvc builtin.

Of course if it's already publicly documented for you the horse has presumably already left the barn which makes the discussion moot?

In D87974#2440533, @BillyONeal wrote:

Of course if it's already publicly documented for you the horse has presumably already left the barn which makes the discussion moot?

It's not in a shipping release yet. But the point of documenting such built-ins partly so that other compiler implementors (and vendors of tools such as static analyzers) know what they're trying to be consistent with.

mnatsuhara added a subscriber: mnatsuhara.Dec 9 2020, 7:06 AM

After following up with the team responsible for __builtin_zero_non_value_bits on MSVC, I have some more information to offer in conjunction with @BillyONeal's report:

For unions, we always assume that it has unique object representations and thus does not have any padding bytes. This allows __builtin_zero_non_value_bits to be used with types with union members, not always accurately but never destructively. (If the union has padding bytes, we don't know which member is active so we don't know exactly where the padding bytes are. To return padding bytes assuming any member risks changing the value.)
Our implementation does appear to include tail padding and this would include any such padding that is due to alignment requirements, etc.

scott.linder mentioned this in D98477: [ADT] Add IntrusiveVariant, VariantTraits, and new STLForwardCompat.Jun 3 2021, 12:22 PM

barcharcraz added a subscriber: barcharcraz.Jun 28 2022, 12:20 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 28 2022, 12:20 PM

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

Builtins.def

1 line

lib/

CodeGen/

CGBuiltin.cpp

96 lines

Sema/

SemaChecking.cpp

20 lines

test/

CodeGenCXX/

builtin-zero-non-value-bits-codegen.cpp

112 lines

builtin-zero-non-value-bits.cpp

249 lines

SemaCXX/

builtin-zero-non-value-bits.cpp

15 lines

Diff 295445

clang/include/clang/Basic/Builtins.def

	Show First 20 Lines • Show All 528 Lines • ▼ Show 20 Lines
	BUILTIN(__builtin_unwind_init, "v", "")			BUILTIN(__builtin_unwind_init, "v", "")
	BUILTIN(__builtin_eh_return_data_regno, "iIi", "nc")			BUILTIN(__builtin_eh_return_data_regno, "iIi", "nc")
	BUILTIN(__builtin_snprintf, "iczcC.", "nFp:2:")			BUILTIN(__builtin_snprintf, "iczcC.", "nFp:2:")
	BUILTIN(__builtin_vsprintf, "iccCa", "nFP:1:")			BUILTIN(__builtin_vsprintf, "iccCa", "nFP:1:")
	BUILTIN(__builtin_vsnprintf, "iczcCa", "nFP:2:")			BUILTIN(__builtin_vsnprintf, "iczcCa", "nFP:2:")
	BUILTIN(__builtin_thread_pointer, "v*", "nc")			BUILTIN(__builtin_thread_pointer, "v*", "nc")
	BUILTIN(__builtin_launder, "vv", "nt")			BUILTIN(__builtin_launder, "vv", "nt")
	LANGBUILTIN(__builtin_is_constant_evaluated, "b", "n", CXX_LANG)			LANGBUILTIN(__builtin_is_constant_evaluated, "b", "n", CXX_LANG)
				LANGBUILTIN(__builtin_zero_non_value_bits, "v.", "n", CXX_LANG)

	// GCC exception builtins			// GCC exception builtins
	BUILTIN(__builtin_eh_return, "vzv*", "r") // FIXME: Takes intptr_t, not size_t!			BUILTIN(__builtin_eh_return, "vzv*", "r") // FIXME: Takes intptr_t, not size_t!
	BUILTIN(__builtin_frob_return_addr, "vv", "n")			BUILTIN(__builtin_frob_return_addr, "vv", "n")
	BUILTIN(__builtin_dwarf_cfa, "v*", "n")			BUILTIN(__builtin_dwarf_cfa, "v*", "n")
	BUILTIN(__builtin_init_dwarf_reg_size_table, "vv*", "n")			BUILTIN(__builtin_init_dwarf_reg_size_table, "vv*", "n")
	BUILTIN(__builtin_dwarf_sp_column, "Ui", "n")			BUILTIN(__builtin_dwarf_sp_column, "Ui", "n")
	BUILTIN(__builtin_extend_pointer, "ULLiv*", "n") // _Unwind_Word == uint64_t			BUILTIN(__builtin_extend_pointer, "ULLiv*", "n") // _Unwind_Word == uint64_t
	▲ Show 20 Lines • Show All 1,049 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGBuiltin.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,636 Lines • ▼ Show 20 Lines	RValue CodeGenFunction::emitRotate(const CallExpr *E, bool IsRotateRight) {
ShiftAmt = Builder.CreateIntCast(ShiftAmt, Ty, false);		ShiftAmt = Builder.CreateIntCast(ShiftAmt, Ty, false);

// Rotate is a special case of LLVM funnel shift - 1st 2 args are the same.		// Rotate is a special case of LLVM funnel shift - 1st 2 args are the same.
unsigned IID = IsRotateRight ? Intrinsic::fshr : Intrinsic::fshl;		unsigned IID = IsRotateRight ? Intrinsic::fshr : Intrinsic::fshl;
Function *F = CGM.getIntrinsic(IID, Ty);		Function *F = CGM.getIntrinsic(IID, Ty);
return RValue::get(Builder.CreateCall(F, { Src, Src, ShiftAmt }));		return RValue::get(Builder.CreateCall(F, { Src, Src, ShiftAmt }));
}		}

		static void RecursivelyZeroNonValueBits(CodeGenFunction &CGF, Value *Ptr,
		QualType Ty) {
		auto *I8Ptr = CGF.Builder.CreateBitCast(Ptr, CGF.Int8PtrTy);
		jfbUnsubmitted Not Done Reply Inline Actions I'd like to hear @rsmith's thoughts on this approach, in particular w.r.t. aliasing concerns. I also wonder if below the GEPs should be inbounds, depending on how they're created. jfb: I'd like to hear @rsmith's thoughts on this approach, in particular w.r.t. aliasing concerns. I…
		zoecarverAuthorUnsubmitted Done Reply Inline Actions Hmm, I don't know much about it but, I think these could be inbound. Because we will never actually go beyond the size of the llvm type. zoecarver: Hmm, I don't know much about it but, I think these could be inbound. Because we will never…
		auto *Zero = ConstantInt::get(CGF.Int8Ty, 0);
		auto WriteZeroAtOffset = [&](size_t Offset) {
		auto Index = ConstantInt::get(CGF.IntTy, Offset);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto Index' can be declared as 'auto Index' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto Index' can be declared as 'auto *Index' [llvm-qualified-auto]…
		auto Element = CGF.Builder.CreateGEP(I8Ptr, Index);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto Element' can be declared as 'auto Element' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto Element' can be declared as 'auto *Element' [llvm-qualified-auto]…
		CGF.Builder.CreateAlignedStore(
		jfbUnsubmitted Done Reply Inline Actions You should use `alignmentAtOffset` here. jfb: You should use `alignmentAtOffset` here.
		zoecarverAuthorUnsubmitted Done Reply Inline Actions I'm using `CharUnits::One().alignmentAtOffset` to here because this type will always have a size of 1 because it's an `i8` ptr. zoecarver: I'm using `CharUnits::One().alignmentAtOffset` to here because this type will always have a…
		Zero, Element,
		CharUnits::One().alignmentAtOffset(CharUnits::fromQuantity(Offset)));
		};
		auto GetStructLayout = [&CGF](llvm::Type *Ty) {
		auto ST = cast<StructType>(Ty);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto ST' can be declared as 'auto ST' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto ST' can be declared as 'auto *ST' [llvm-qualified-auto] [[https…
		return CGF.CGM.getModule().getDataLayout().getStructLayout(ST);
		};

		auto ST = cast<StructType>(Ptr->getType()->getPointerElementType());
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto ST' can be declared as 'auto ST' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto ST' can be declared as 'auto *ST' [llvm-qualified-auto] [[https…
		auto SL = GetStructLayout(ST);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto SL' can be declared as 'const auto SL' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto SL' can be declared as 'const auto *SL' [llvm-qualified-auto]…
		auto R = cast<CXXRecordDecl>(Ty->getAsRecordDecl());
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto R' can be declared as 'auto R' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto R' can be declared as 'auto *R' [llvm-qualified-auto] [[https…
		const ASTRecordLayout &ASTLayout = CGF.getContext().getASTRecordLayout(R);
		size_t RunningOffset = 0;
		for (auto Base : R->bases()) {
		// Zero padding between base elements.
		auto BaseRecord = cast<CXXRecordDecl>(Base.getType()->getAsRecordDecl());
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto BaseRecord' can be declared as 'auto BaseRecord' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto BaseRecord' can be declared as 'auto *BaseRecord' [llvm-qualified…
		auto Offset = static_cast<size_t>(
		ASTLayout.getBaseClassOffset(BaseRecord).getQuantity());
		for (; RunningOffset < Offset; ++RunningOffset) {
		WriteZeroAtOffset(RunningOffset);
		}
		// Recursively zero out base classes.
		auto Index = SL->getElementContainingOffset(Offset);
		auto BaseElement = CGF.Builder.CreateStructGEP(Ptr, Index);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto BaseElement' can be declared as 'auto BaseElement' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto BaseElement' can be declared as 'auto *BaseElement' [llvm-qualified…
		RecursivelyZeroNonValueBits(CGF, BaseElement, Base.getType());
		// Use the LLVM StructType data layout so we pick up on packed types.
		auto SL = GetStructLayout(ST->getElementType(Index));
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto SL' can be declared as 'const auto SL' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto SL' can be declared as 'const auto *SL' [llvm-qualified-auto]…
		auto Size = SL->getSizeInBytes();
		RunningOffset = Offset + Size;
		}
		jfbUnsubmitted Done Reply Inline Actions Typo in "fields". jfb: Typo in "fields".

		size_t NumFields = std::distance(R->field_begin(), R->field_end());
		auto CurrentField = R->field_begin();
		for (size_t I = 0; I < NumFields; ++I, ++CurrentField) {
		// Size needs to be in bytes so we can compare it later.
		auto Offset = ASTLayout.getFieldOffset(I) / 8;
		for (; RunningOffset < Offset; ++RunningOffset) {
		WriteZeroAtOffset(RunningOffset);
		}

		auto Index = SL->getElementContainingOffset(Offset);
		// If this field is an object, it may have non-zero padding.
		if (CurrentField->getType()->isRecordType()) {
		auto Element = CGF.Builder.CreateStructGEP(Ptr, Index);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto Element' can be declared as 'auto Element' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto Element' can be declared as 'auto *Element' [llvm-qualified-auto]…
		RecursivelyZeroNonValueBits(CGF, Element, CurrentField->getType());
		}

		// TODO: warn if non-constant array type.
		if (isa<ConstantArrayType>(CurrentField->getType()) &&
		CurrentField->getType()
		->getArrayElementTypeNoTypeQual()
		->isRecordType()) {
		zoecarverAuthorUnsubmitted Done Reply Inline Actions Is it OK to possibly create hundreds of stores here? I assume later optimizations will catch this and turn it into a loop or a call to memset or something. But this could potentially be harmful to code size. zoecarver: Is it OK to possibly create hundreds of stores here? I assume later optimizations will catch…
		auto FieldElement = CGF.Builder.CreateStructGEP(Ptr, Index);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto FieldElement' can be declared as 'auto FieldElement' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto FieldElement' can be declared as 'auto *FieldElement' [llvm…
		auto AT = cast<ConstantArrayType>(CurrentField->getType());
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto AT' can be declared as 'const auto AT' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto AT' can be declared as 'const auto *AT' [llvm-qualified-auto]…
		for (size_t ArrIndex = 0; ArrIndex < AT->getSize().getLimitedValue();
		++ArrIndex) {
		auto ElementRecord = AT->getElementType()->getAsRecordDecl();
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto ElementRecord' can be declared as 'auto ElementRecord' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto ElementRecord' can be declared as 'auto *ElementRecord' [llvm…
		auto ElementAlign =
		CGF.getContext().getASTRecordLayout(ElementRecord).getAlignment();
		Address FieldElementAddr{FieldElement, ElementAlign};
		auto Element =
		CGF.Builder.CreateConstArrayGEP(FieldElementAddr, ArrIndex);
		RecursivelyZeroNonValueBits(CGF, Element.getPointer(),
		AT->getElementType());
		}
		}

		auto Size = CGF.CGM.getModule()
		.getDataLayout()
		.getTypeSizeInBits(ST->getElementType(Index))
		.getKnownMinSize() /
		8;
		RunningOffset = Offset + Size;
		}
		// Clear all bits after the last field.
		auto Size = SL->getSizeInBytes();
		for (; RunningOffset < Size; ++RunningOffset) {
		WriteZeroAtOffset(RunningOffset);
		}
		}

RValue CodeGenFunction::EmitBuiltinExpr(const GlobalDecl GD, unsigned BuiltinID,		RValue CodeGenFunction::EmitBuiltinExpr(const GlobalDecl GD, unsigned BuiltinID,
const CallExpr *E,		const CallExpr *E,
ReturnValueSlot ReturnValue) {		ReturnValueSlot ReturnValue) {
const FunctionDecl *FD = GD.getDecl()->getAsFunction();		const FunctionDecl *FD = GD.getDecl()->getAsFunction();
// See if we can constant fold this builtin. If so, don't emit it at all.		// See if we can constant fold this builtin. If so, don't emit it at all.
Expr::EvalResult Result;		Expr::EvalResult Result;
if (E->EvaluateAsRValue(Result, CGM.getContext()) &&		if (E->EvaluateAsRValue(Result, CGM.getContext()) &&
!Result.hasSideEffects()) {		!Result.hasSideEffects()) {
▲ Show 20 Lines • Show All 1,288 Lines • ▼ Show 20 Lines	case Builtin::BI__builtin_launder: {
const Expr *Arg = E->getArg(0);		const Expr *Arg = E->getArg(0);
QualType ArgTy = Arg->getType()->getPointeeType();		QualType ArgTy = Arg->getType()->getPointeeType();
Value *Ptr = EmitScalarExpr(Arg);		Value *Ptr = EmitScalarExpr(Arg);
if (TypeRequiresBuiltinLaunder(CGM, ArgTy))		if (TypeRequiresBuiltinLaunder(CGM, ArgTy))
Ptr = Builder.CreateLaunderInvariantGroup(Ptr);		Ptr = Builder.CreateLaunderInvariantGroup(Ptr);

return RValue::get(Ptr);		return RValue::get(Ptr);
}		}
		case Builtin::BI__builtin_zero_non_value_bits: {
		const Expr *Op = E->getArg(0);
		Value *Address = EmitScalarExpr(Op);
		auto PointeeTy = Op->getType()->getPointeeType();
		RecursivelyZeroNonValueBits(*this, Address, PointeeTy);
		return RValue::get(nullptr);
		}
case Builtin::BI__sync_fetch_and_add:		case Builtin::BI__sync_fetch_and_add:
case Builtin::BI__sync_fetch_and_sub:		case Builtin::BI__sync_fetch_and_sub:
case Builtin::BI__sync_fetch_and_or:		case Builtin::BI__sync_fetch_and_or:
case Builtin::BI__sync_fetch_and_and:		case Builtin::BI__sync_fetch_and_and:
case Builtin::BI__sync_fetch_and_xor:		case Builtin::BI__sync_fetch_and_xor:
case Builtin::BI__sync_fetch_and_nand:		case Builtin::BI__sync_fetch_and_nand:
case Builtin::BI__sync_add_and_fetch:		case Builtin::BI__sync_add_and_fetch:
case Builtin::BI__sync_sub_and_fetch:		case Builtin::BI__sync_sub_and_fetch:
▲ Show 20 Lines • Show All 13,796 Lines • Show Last 20 Lines

clang/lib/Sema/SemaChecking.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,587 Lines • ▼ Show 20 Lines	case Builtin::BI__builtin_constant_p: {
ExprResult Arg = DefaultFunctionArrayLvalueConversion(TheCall->getArg(0));		ExprResult Arg = DefaultFunctionArrayLvalueConversion(TheCall->getArg(0));
if (Arg.isInvalid()) return true;		if (Arg.isInvalid()) return true;
TheCall->setArg(0, Arg.get());		TheCall->setArg(0, Arg.get());
TheCall->setType(Context.IntTy);		TheCall->setType(Context.IntTy);
break;		break;
}		}
case Builtin::BI__builtin_launder:		case Builtin::BI__builtin_launder:
return SemaBuiltinLaunder(*this, TheCall);		return SemaBuiltinLaunder(*this, TheCall);
		case Builtin::BI__builtin_zero_non_value_bits: {
		const Expr *PtrArg = TheCall->getArg(0)->IgnoreParenImpCasts();
		const QualType PtrArgType = PtrArg->getType();
		if (!PtrArgType->isPointerType() \|\|
		!PtrArgType->getPointeeType()->isRecordType()) {
		Diag(PtrArg->getBeginLoc(), diag::err_typecheck_convert_incompatible)
		<< PtrArgType << "structure pointer" << 1 << 0 << 3 << 1 << PtrArgType
		<< "structure pointer";
		return ExprError();
		}
		if (PtrArgType->getPointeeType().isConstQualified()) {
		Diag(PtrArg->getBeginLoc(), diag::err_typecheck_assign_const)
		<< TheCall->getSourceRange() << 5 /ConstUnknown/;
		return ExprError();
		}
		if (RequireCompleteType(PtrArg->getBeginLoc(), PtrArgType->getPointeeType(),
		diag::err_typecheck_decl_incomplete_type))
		return ExprError();
		break;
		}
case Builtin::BI__sync_fetch_and_add:		case Builtin::BI__sync_fetch_and_add:
case Builtin::BI__sync_fetch_and_add_1:		case Builtin::BI__sync_fetch_and_add_1:
case Builtin::BI__sync_fetch_and_add_2:		case Builtin::BI__sync_fetch_and_add_2:
case Builtin::BI__sync_fetch_and_add_4:		case Builtin::BI__sync_fetch_and_add_4:
case Builtin::BI__sync_fetch_and_add_8:		case Builtin::BI__sync_fetch_and_add_8:
case Builtin::BI__sync_fetch_and_add_16:		case Builtin::BI__sync_fetch_and_add_16:
case Builtin::BI__sync_fetch_and_sub:		case Builtin::BI__sync_fetch_and_sub:
case Builtin::BI__sync_fetch_and_sub_1:		case Builtin::BI__sync_fetch_and_sub_1:
▲ Show 20 Lines • Show All 14,233 Lines • Show Last 20 Lines

clang/test/CodeGenCXX/builtin-zero-non-value-bits-codegen.cpp

This file was added.

				// RUN: %clang_cc1 -triple=x86_64-linux-gnu -emit-llvm -o - %s \| FileCheck %s

				struct alignas(4) Foo {
				char a;
				alignas(2) char b;
				};

				struct alignas(4) Bar {
				char c;
				alignas(2) char d;
				};

				struct alignas(4) Baz : Foo {
				char e;
				Bar f;
				};

				jfbUnsubmitted Done Reply Inline Actions It would be helpful to have a comment with the final layout of the struct, including padding. Give each padding field a name, and reference them in the IR check below. jfb: It would be helpful to have a comment with the final layout of the struct, including padding.
				zoecarverAuthorUnsubmitted Done Reply Inline Actions Done. I've named each padding field as "PAD_X" so below it should be clear what fields are being stored without a comment. zoecarver: Done. I've named each padding field as "PAD_X" so below it should be clear what fields are…
				// Baz structure:
				// "a", PAD_1, "b", PAD_2, "c", PAD_3, PAD_4, PAD_5, "c", PAD_6, "d", PAD_7
				// %struct.Baz = type { %struct.Foo, i8, [3 x i8], %struct.Bar }
				// %struct.Foo = type { i8, i8, i8, i8 }
				// %struct.Bar = type { i8, i8, i8, i8 }

				// CHECK-LABEL: define void @_Z7testBazP3Baz(%struct.Baz* %baz)
				// CHECK: [[ADDR:%.]] = alloca %struct.Baz
				// CHECK: store %struct.Baz* %baz, %struct.Baz** [[ADDR]]
				// CHECK: [[BAZ:%.]] = load %struct.Baz, %struct.Baz** [[ADDR]]
				jfbUnsubmitted Done Reply Inline Actions It would help read the tests if you had a comment on top of each store, for example here "padding byte X". jfb: It would help read the tests if you had a comment on top of each store, for example here…
				zoecarverAuthorUnsubmitted Done Reply Inline Actions See above comment. zoecarver: See above comment.
				// CHECK: [[BAZ_RAW_PTR:%.]] = bitcast %struct.Baz [[BAZ]] to i8*

				// CHECK: [[FOO_BASE:%.]] = getelementptr inbounds %struct.Baz, %struct.Baz [[BAZ]], i32 0, i32 0
				// CHECK: [[FOO_RAW_PTR:%.]] = bitcast %struct.Foo [[FOO_BASE]] to i8*
				// CHECK: [[PAD_1:%.]] = getelementptr i8, i8 [[FOO_RAW_PTR]], i32 1
				// CHECK: store i8 0, i8* [[PAD_1]]
				// CHECK: [[PAD_2:%.]] = getelementptr i8, i8 [[FOO_RAW_PTR]], i32 3
				// CHECK: store i8 0, i8* [[PAD_2]]

				// CHECK: [[PAD_3:%.]] = getelementptr i8, i8 [[BAZ_RAW_PTR]], i32 5
				// CHECK: store i8 0, i8* [[PAD_3]]
				// CHECK: [[PAD_4:%.]] = getelementptr i8, i8 [[BAZ_RAW_PTR]], i32 6
				// CHECK: store i8 0, i8* [[PAD_4]]
				// CHECK: [[PAD_5:%.]] = getelementptr i8, i8 [[BAZ_RAW_PTR]], i32 7
				// CHECK: store i8 0, i8* [[PAD_5]]

				// CHECK: [[BAR_MEMBER:%.]] = getelementptr inbounds %struct.Baz, %struct.Baz [[BAZ]], i32 0, i32 3
				// CHECK: [[BAR_RAW_PTR:%.]] = bitcast %struct.Bar [[BAR_MEMBER]] to i8*
				// CHECK: [[PAD_6:%.]] = getelementptr i8, i8 [[BAR_RAW_PTR]], i32 1
				// CHECK: store i8 0, i8* [[PAD_6]]
				jfbUnsubmitted Done Reply Inline Actions It would be useful to see a test for arrays with a type that contains tail padding. jfb: It would be useful to see a test for arrays with a type that contains tail padding.
				zoecarverAuthorUnsubmitted Done Reply Inline Actions Hmm, this test case doesn't seem to be working. I'll investigate further. zoecarver: Hmm, this test case doesn't seem to be working. I'll investigate further.
				zoecarverAuthorUnsubmitted Done Reply Inline Actions OK, I've added that. Just to clarify, you mean a type that contains a constant array type of types with tail padding (i.e., `Bar [2]`)? zoecarver: OK, I've added that. Just to clarify, you mean a type that contains a constant array type of…
				zoecarverAuthorUnsubmitted Done Reply Inline Actions Let me know if there are more codegen tests you'd like me to add. Happy to add them. (Maybe one for vector types, or volatile, or one that doesn't have an non-value bits?) zoecarver: Let me know if there are more codegen tests you'd like me to add. Happy to add them. (Maybe one…
				// CHECK: [[PAD_7:%.]] = getelementptr i8, i8 [[BAR_RAW_PTR]], i32 3
				// CHECK: store i8 0, i8* [[PAD_7]]
				// CHECK: ret void
				void testBaz(Baz *baz) {
				__builtin_zero_non_value_bits(baz);
				}

				struct UnsizedTail {
				int size;
				alignas(8) char buf[];

				UnsizedTail(int size) : size(size) {}
				};

				// UnsizedTail structure:
				// "size", PAD_1, PAD_2, PAD_3, PAD_4
				// %struct.UnsizedTail = type { i32, [4 x i8], [0 x i8] }

				// CHECK-LABEL: define void @_Z15testUnsizedTailP11UnsizedTail(%struct.UnsizedTail* %u)
				// CHECK: [[U_ADDR:%.]] = alloca %struct.UnsizedTail
				// CHECK: store %struct.UnsizedTail* %u, %struct.UnsizedTail** [[U_ADDR]]
				// CHECK: [[U:%.]] = load %struct.UnsizedTail, %struct.UnsizedTail** [[U_ADDR]]
				// CHECK: [[U_RAW_PTR:%.]] = bitcast %struct.UnsizedTail [[U]] to i8*
				// CHECK: [[PAD_1:%.]] = getelementptr i8, i8 [[U_RAW_PTR]], i32 4
				// CHECK: store i8 0, i8* [[PAD_1]]
				// CHECK: [[PAD_2:%.]] = getelementptr i8, i8 [[U_RAW_PTR]], i32 5
				// CHECK: store i8 0, i8* [[PAD_2]]
				// CHECK: [[PAD_3:%.]] = getelementptr i8, i8 [[U_RAW_PTR]], i32 6
				// CHECK: store i8 0, i8* [[PAD_3]]
				// CHECK: [[PAD_4:%.]] = getelementptr i8, i8 [[U_RAW_PTR]], i32 7
				// CHECK: store i8 0, i8* [[PAD_4]]
				// CHECK: ret void
				void testUnsizedTail(UnsizedTail *u) {
				__builtin_zero_non_value_bits(u);
				}

				struct ArrOfStructsWithPadding {
				Bar bars[2];
				};

				// ArrOfStructsWithPadding structure:
				// "c" (1), PAD_1, "d" (1), PAD_2, "c" (2), PAD_3, "d" (2), PAD_4
				// %struct.ArrOfStructsWithPadding = type { [2 x %struct.Bar] }

				// CHECK-LABEL: define void @_Z27testArrOfStructsWithPaddingP23ArrOfStructsWithPadding(%struct.ArrOfStructsWithPadding* %arr)
				// CHECK: [[ARR_ADDR:%.]] = alloca %struct.ArrOfStructsWithPadding
				// CHECK: store %struct.ArrOfStructsWithPadding* %arr, %struct.ArrOfStructsWithPadding** [[ARR_ADDR]]
				// CHECK: [[ARR:%.]] = load %struct.ArrOfStructsWithPadding, %struct.ArrOfStructsWithPadding** [[ARR_ADDR]]
				// CHECK: [[BARS:%.]] = getelementptr inbounds %struct.ArrOfStructsWithPadding, %struct.ArrOfStructsWithPadding [[ARR]], i32 0, i32 0
				// CHECK: [[FIRST:%.]] = getelementptr inbounds [2 x %struct.Bar], [2 x %struct.Bar] [[BARS]], i64 0, i64 0
				// CHECK: [[FIRST_RAW_PTR:%.]] = bitcast %struct.Bar [[FIRST]] to i8*
				// CHECK: [[PAD_1:%.]] = getelementptr i8, i8 [[FIRST_RAW_PTR]], i32 1
				// CHECK: store i8 0, i8* [[PAD_1]]
				// CHECK: [[PAD_2:%.]] = getelementptr i8, i8 %4, i32 3
				// CHECK: store i8 0, i8* [[PAD_2]]
				// CHECK: [[SECOND:%.]] = getelementptr inbounds [2 x %struct.Bar], [2 x %struct.Bar] [[BARS]], i64 0, i64 1
				// CHECK: [[SECOND_RAW_PTR:%.]] = bitcast %struct.Bar [[SECOND]] to i8*
				// CHECK: [[PAD_3:%.]] = getelementptr i8, i8 [[SECOND_RAW_PTR]], i32 1
				// CHECK: store i8 0, i8* [[PAD_3]]
				// CHECK: [[PAD_4:%.]] = getelementptr i8, i8 [[SECOND_RAW_PTR]], i32 3
				// CHECK: store i8 0, i8* [[PAD_4]]
				// CHECK: ret void
				void testArrOfStructsWithPadding(ArrOfStructsWithPadding *arr) {
				__builtin_zero_non_value_bits(arr);
				}

clang/test/CodeGenCXX/builtin-zero-non-value-bits.cpp

This file was added.

				// RUN: mkdir -p %t
				// RUN: %clang++ %s -o %t/run
				// RUN: %t/run

				#include <cassert>
				#include <cstdio>
				#include <cstring>
				#include <new>

				template <size_t A1, size_t A2, class T>
				struct alignas(A1) BasicWithPadding {
				T x;
				alignas(A2) T y;
				};

				template <size_t A1, size_t A2, size_t N, class T>
				struct alignas(A1) SpacedArrayMembers {
				T x[N];
				alignas(A2) char c;
				T y[N];
				};

				template <size_t A1, size_t A2, class T>
				struct alignas(A1) PaddedPointerMembers {
				T *x;
				alignas(A2) T *y;
				};

				template <size_t A1, size_t A2, size_t A3, class T>
				struct alignas(A1) ThreeMembers {
				T x;
				alignas(A2) T y;
				alignas(A3) T z;
				};

				template <class T>
				struct Normal {
				T a;
				T b;
				};

				template <class T>
				struct X {
				T x;
				};

				template <class T>
				struct Z {
				T z;
				};

				template <size_t A, class T>
				struct YZ : public Z<T> {
				alignas(A) T y;
				};

				template <size_t A1, size_t A2, class T>
				struct alignas(A1) HasBase : public X<T>, public YZ<A2, T> {
				T a;
				alignas(A2) T b;
				};

				template <size_t A1, size_t A2, class T>
				void testAllForType(T a, T b, T c, T d) {
				using B = BasicWithPadding<A1, A2, T>;
				B basic1;
				memset(&basic1, 0, sizeof(B));
				basic1.x = a;
				basic1.y = b;
				B basic2;
				memset(&basic2, 42, sizeof(B));
				basic2.x = a;
				basic2.y = b;
				assert(memcmp(&basic1, &basic2, sizeof(B)) != 0);
				__builtin_zero_non_value_bits(&basic2);
				assert(memcmp(&basic1, &basic2, sizeof(B)) == 0);

				using A = SpacedArrayMembers<A1, A2, 2, T>;
				A arr1;
				memset(&arr1, 0, sizeof(A));
				arr1.x[0] = a;
				arr1.x[1] = b;
				arr1.y[0] = c;
				arr1.y[1] = d;
				A arr2;
				memset(&arr2, 42, sizeof(A));
				arr2.x[0] = a;
				arr2.x[1] = b;
				arr2.y[0] = c;
				arr2.y[1] = d;
				arr2.c = 0;
				assert(memcmp(&arr1, &arr2, sizeof(A)) != 0);
				__builtin_zero_non_value_bits(&arr2);
				assert(memcmp(&arr1, &arr2, sizeof(A)) == 0);

				using P = PaddedPointerMembers<A1, A2, T>;
				P ptr1;
				memset(&ptr1, 0, sizeof(P));
				ptr1.x = &a;
				ptr1.y = &b;
				P ptr2;
				memset(&ptr2, 42, sizeof(P));
				ptr2.x = &a;
				ptr2.y = &b;
				assert(memcmp(&ptr1, &ptr2, sizeof(P)) != 0);
				__builtin_zero_non_value_bits(&ptr2);
				assert(memcmp(&ptr1, &ptr2, sizeof(P)) == 0);

				using Three = ThreeMembers<A1, A2, A2, T>;
				Three three1;
				memset(&three1, 0, sizeof(Three));
				three1.x = a;
				three1.y = b;
				three1.z = c;
				Three three2;
				memset(&three2, 42, sizeof(Three));
				three2.x = a;
				three2.y = b;
				three2.z = c;
				__builtin_zero_non_value_bits(&three2);
				assert(memcmp(&three1, &three2, sizeof(Three)) == 0);

				using N = Normal<T>;
				N normal1;
				memset(&normal1, 0, sizeof(N));
				normal1.a = a;
				normal1.b = b;
				N normal2;
				memset(&normal2, 42, sizeof(N));
				normal2.a = a;
				normal2.b = b;
				__builtin_zero_non_value_bits(&normal2);
				assert(memcmp(&normal1, &normal2, sizeof(N)) == 0);

				using H = HasBase<A1, A2, T>;
				H base1;
				memset(&base1, 0, sizeof(H));
				base1.a = a;
				base1.b = b;
				base1.x = c;
				base1.y = d;
				base1.z = a;
				H base2;
				memset(&base2, 42, sizeof(H));
				base2.a = a;
				base2.b = b;
				base2.x = c;
				base2.y = d;
				base2.z = a;
				assert(memcmp(&base1, &base2, sizeof(H)) != 0);
				__builtin_zero_non_value_bits(&base2);
				unsigned i = 0;
				assert(memcmp(&base1, &base2, sizeof(H)) == 0);
				}

				struct UnsizedTail {
				int size;
				alignas(8) char buf[];

				UnsizedTail(int size) : size(size) {}
				};
				jfbUnsubmitted Not Done Reply Inline Actions Usually CodeGen tests will use lit to check the emitted IR matches expectations. I think that's what you want to do here. Remember to test `volatile` qualified pointers, as well as address spaces too. jfb: Usually CodeGen tests will use lit to check the emitted IR matches expectations. I think that's…
				zoecarverAuthorUnsubmitted Not Done Reply Inline Actions What's a good place for me to put this end-to-end test? zoecarver: What's a good place for me to put this end-to-end test?
				jfbUnsubmitted Not Done Reply Inline Actions I'm not sure, I don't usually add this type of test :) jfb: I'm not sure, I don't usually add this type of test :)
				zoecarverAuthorUnsubmitted Done Reply Inline Actions Even if it's a bit unconventional, I think it would be good to have this type of test. I think we should try to cover as many test cases as possible because it's important that this builtin both doesn't zero non-padding bits and does zero all padding bits. And it wouldn't be practical to add the 100+ test cases covered here as codegen tests. zoecarver: Even if it's a bit unconventional, I think it would be good to have this type of test. I think…

				void otherTests() {
				const size_t size1 = sizeof(UnsizedTail) + 4;
				char buff1[size1];
				char buff2[size1];
				memset(buff1, 0, size1);
				memset(buff2, 42, size1);
				auto *u1 = new (buff1) UnsizedTail(4);
				u1->buf[0] = 1;
				u1->buf[1] = 2;
				u1->buf[2] = 3;
				u1->buf[3] = 4;
				auto *u2 = new (buff2) UnsizedTail(4);
				u2->buf[0] = 1;
				u2->buf[1] = 2;
				u2->buf[2] = 3;
				u2->buf[3] = 4;
				assert(memcmp(u1, u2, sizeof(UnsizedTail)) != 0);
				__builtin_zero_non_value_bits(u2);
				assert(memcmp(u1, u2, sizeof(UnsizedTail)) == 0);

				using B = BasicWithPadding<8, 4, char>;
				auto *basic1 = new B;
				memset(basic1, 0, sizeof(B));
				basic1->x = 1;
				basic1->y = 2;
				auto *basic2 = new B;
				memset(basic2, 42, sizeof(B));
				basic2->x = 1;
				basic2->y = 2;
				assert(memcmp(basic1, basic2, sizeof(B)) != 0);
				__builtin_zero_non_value_bits(basic2);
				assert(memcmp(basic1, basic2, sizeof(B)) == 0);
				delete basic2;
				delete basic1;

				using B = BasicWithPadding<8, 4, char>;
				B *basic3 = new B;
				memset(basic3, 0, sizeof(B));
				basic3->x = 1;
				basic3->y = 2;
				B *basic4 = new B;
				memset(basic4, 42, sizeof(B));
				basic4->x = 1;
				basic4->y = 2;
				assert(memcmp(basic3, basic4, sizeof(B)) != 0);
				__builtin_zero_non_value_bits(const_cast<volatile B *>(basic4));
				assert(memcmp(basic3, basic4, sizeof(B)) == 0);
				delete basic4;
				delete basic3;
				}

				struct Foo {
				int x;
				int y;
				};

				typedef float Float4Vec __attribute__((ext_vector_type(4)));
				typedef float Float3Vec __attribute__((ext_vector_type(3)));

				int main() {
				testAllForType<32, 16, char>(11, 22, 33, 44);
				testAllForType<64, 32, char>(4, 5, 6, 7);
				testAllForType<32, 16, volatile char>(11, 22, 33, 44);
				testAllForType<64, 32, volatile char>(4, 5, 6, 7);
				testAllForType<32, 16, int>(0, 1, 2, 3);
				testAllForType<64, 32, int>(4, 5, 6, 7);
				testAllForType<32, 16, volatile int>(0, 1, 2, 3);
				testAllForType<64, 32, volatile int>(4, 5, 6, 7);
				testAllForType<32, 16, double>(0, 1, 2, 3);
				testAllForType<64, 32, double>(4, 5, 6, 7);
				testAllForType<32, 16, _ExtInt(28)>(0, 1, 2, 3);
				testAllForType<64, 32, _ExtInt(28)>(4, 5, 6, 7);
				testAllForType<32, 16, _ExtInt(60)>(0, 1, 2, 3);
				testAllForType<64, 32, _ExtInt(60)>(4, 5, 6, 7);
				testAllForType<32, 16, _ExtInt(64)>(0, 1, 2, 3);
				testAllForType<64, 32, _ExtInt(64)>(4, 5, 6, 7);
				testAllForType<32, 16, Foo>(Foo{1, 2}, Foo{3, 4}, Foo{1, 2}, Foo{3, 4});
				testAllForType<64, 32, Foo>(Foo{1, 2}, Foo{3, 4}, Foo{1, 2}, Foo{3, 4});
				testAllForType<256, 128, Float3Vec>(0, 1, 2, 3);
				testAllForType<128, 128, Float3Vec>(4, 5, 6, 7);
				testAllForType<256, 128, Float4Vec>(0, 1, 2, 3);
				testAllForType<128, 128, Float4Vec>(4, 5, 6, 7);

				otherTests();

				return 0;
				}

clang/test/SemaCXX/builtin-zero-non-value-bits.cpp

This file was added.

				// RUN: %clang_cc1 -fsyntax-only -verify %s

				struct Foo {};

				struct Incomplete; // expected-note {{forward declaration of 'Incomplete'}}

				void test(int a, Foo b, void c, int d, Foo e, const Foo f, Incomplete *g) {
				__builtin_zero_non_value_bits(a); // expected-error {{passing 'int' to parameter of incompatible type structure pointer: type mismatch at 1st parameter ('int' vs structure pointer)}}
				__builtin_zero_non_value_bits(b); // expected-error {{passing 'Foo' to parameter of incompatible type structure pointer: type mismatch at 1st parameter ('Foo' vs structure pointer)}}
				__builtin_zero_non_value_bits(c); // expected-error {{passing 'void ' to parameter of incompatible type structure pointer: type mismatch at 1st parameter ('void ' vs structure pointer)}}
				__builtin_zero_non_value_bits(d); // expected-error {{passing 'int ' to parameter of incompatible type structure pointer: type mismatch at 1st parameter ('int ' vs structure pointer)}}
				__builtin_zero_non_value_bits(e); // This should not error.
				jfbUnsubmitted Done Reply Inline Actions You should also check incomplete types, vector, variable width integers, `const` qualified. jfb: You should also check incomplete types, vector, variable width integers, `const` qualified.
				__builtin_zero_non_value_bits(f); // expected-error {{read-only variable is not assignable}}
				__builtin_zero_non_value_bits(g); // expected-error {{variable has incomplete type 'Incomplete'}}
				}