This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/test/CodeGen/
-
test/
-
CodeGen/
-
attr-arm-sve-vector-bits-bitcast.c
-
attr-arm-sve-vector-bits-call.c
-
attr-arm-sve-vector-bits-cast.c
-
attr-arm-sve-vector-bits-globals.c
-
llvm/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstCombineLoadStoreAlloca.cpp
-
test/Transforms/
-
Transforms/
-
InstCombine/
-
atomic.ll
-
load.ll
-
loadstore-metadata.ll
-
non-integral-pointers.ll
-
PhaseOrdering/
-
instcombine-sroa-inttoptr.ll

Differential D88789

[InstCombine] Revert rL226781 "Teach InstCombine to canonicalize loads which are only ever stored to always use a legal integer type if one is available." (PR47592)
ClosedPublic

Authored by lebedev.ri on Oct 3 2020, 1:14 PM.

Download Raw Diff

Details

Reviewers

jfb
spatel
efriedma
MaskRay
chandlerc
nlopes
nikic
hfinkel

Commits

rGe00f189d392d: [InstCombine] Revert rL226781 "Teach InstCombine to canonicalize loads which…

Summary

(it was introduced in https://lists.llvm.org/pipermail/llvm-dev/2015-January/080956.html)

This canonicalization seems dubious.

Most importantly, while it does not create inttoptr casts by itself,
it may cause them to appear later, see e.g. D88788.

I think it's pretty obvious that it is an undesirable outcome,
by now we've established that seemingly no-op inttoptr/ptrtoint casts
are not no-op, and are no longer eager to look past them.
Which e.g. means that given

%a = load i32 
%b = inttoptr %a
%c = inttoptr %a

we likely won't be able to tell that %b and %c is the same thing.

We could of course try to cleanup the IR afterwards, by enhancing
the cast-of-load transform to deal with non-single-use loads,
and i even tried that already in D75505, as it was rightfully
pointed out, that is very much not compile-time free:
https://llvm-compile-time-tracker.com/compare.php?from=871d03a6751e0f82e210c80a881ef357c5633a26&to=782be5b99377b62e998e4157ddede0fa296664b5&stat=instructions

Thusly, i'd propose to simply not perform such a canonicalization.
The original motivational RFC does not state what larger problem that canonicalization
was trying to solve, so i'm not sure how this plays out in the larger picture.

Does anyone have any thoughts?

See https://bugs.llvm.org/show_bug.cgi?id=47592

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

lebedev.ri created this revision.Oct 3 2020, 1:14 PM

Herald added subscribers: dexonsmith, hiraditya. · View Herald TranscriptOct 3 2020, 1:14 PM

lebedev.ri requested review of this revision.Oct 3 2020, 1:14 PM

lebedev.ri added a parent revision: D88788: [SROA] rewritePartition()/findCommonType(): if uses have conflicting type, try getTypePartition() before falling back to largest integral use type (PR47592).

Harbormaster completed remote builds in B73891: Diff 296001.Oct 3 2020, 1:29 PM

Love it, thanks!
This gets rid of a lot of type punning issues through load/store of integers. Not introducing inttoptr during optimization is a very healthy goal.

This revision is now accepted and ready to land.Oct 3 2020, 2:58 PM

as it was rightfully pointed out, that is very much not compile-time free: https://llvm-compile-time-tracker.com/compare.php?from=871d03a6751e0f82e210c80a881ef357c5633a26&to=782be5b99377b62e998e4157ddede0fa296664b5&stat=instructions

Looks free to me?

In any case, this change looks reasonable to me. GVN has no problems deduplicating load/stores from different types (https://llvm.godbolt.org/z/5nTjWE), so I'm not sure what this canonicalization was useful for.

FWIW, I still very much feel that this is the correct canonicalization, and that downstream problems *must* be fixed downstream. Avoiding this canonicalization doesn't actually fix them, it just makes us less *aware* of the problems that still fundamentally exist. =[

That said, I'm not heavily involved in LLVM, and so if everyone currently involved thinks this is a good change, I'm not going to stand in the way. It just makes no sense to me.

Rebase/fix remaining tests.

In D88789#2310441, @nlopes wrote:

Not introducing inttoptr during optimization is a very healthy goal.

Thank you for pointing that out.
Indeed, that is very precisely my goal here.

In D88789#2310593, @nikic wrote:

as it was rightfully pointed out, that is very much not compile-time free: https://llvm-compile-time-tracker.com/compare.php?from=871d03a6751e0f82e210c80a881ef357c5633a26&to=782be5b99377b62e998e4157ddede0fa296664b5&stat=instructions

Looks free to me?

We can revisit that patch afterwards.

In D88789#2310593, @nikic wrote:

In any case, this change looks reasonable to me. GVN has no problems deduplicating load/stores from different types (https://llvm.godbolt.org/z/5nTjWE), so I'm not sure what this canonicalization was useful for.

Yep.

In D88789#2310606, @chandlerc wrote:

FWIW, I still very much feel that this is the correct canonicalization, and that downstream problems *must* be fixed downstream. Avoiding this canonicalization doesn't actually fix them, it just makes us less *aware* of the problems that still fundamentally exist. =[

That said, I'm not heavily involved in LLVM, and so if everyone currently involved thinks this is a good change, I'm not going to stand in the way. It just makes no sense to me.

Thank you for commenting!

Herald added a project: Restricted Project. · View Herald TranscriptOct 4 2020, 10:28 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B73922: Diff 296055.Oct 4 2020, 10:45 AM

lebedev.ri mentioned this in D88806: [SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as zext/trunc/self of SCEVUnknown.Oct 4 2020, 2:18 PM

In D88789#2310606, @chandlerc wrote:

FWIW, I still very much feel that this is the correct canonicalization, and that downstream problems *must* be fixed downstream. Avoiding this canonicalization doesn't actually fix them, it just makes us less *aware* of the problems that still fundamentally exist. =[

I'd agree if we excluded all pointers from canonicalization. But the semantics of inttoptr and inttoptr-equivalent memory operations are weird; in general, I'm not sure we can recover the original semantics of the code if we throw away the pointer-ness of pointer load/store operations.

To address the issue at hand, I think changing the isNonIntegralPointerType() check to just isPtrOrPtrVectorTy() would be enough. I think that might make sense?

In D88789#2310967, @efriedma wrote:

To address the issue at hand, I think changing the isNonIntegralPointerType() check to just isPtrOrPtrVectorTy() would be enough. I think that might make sense?

I briefly considered that, but the motivational example in https://lists.llvm.org/pipermail/llvm-dev/2015-January/080956.html
was really about pointer-typed things. So while we could do that, i'm not sure how useful this fold really is overall.

In D88789#2310967, @efriedma wrote:

In D88789#2310606, @chandlerc wrote:

FWIW, I still very much feel that this is the correct canonicalization, and that downstream problems *must* be fixed downstream. Avoiding this canonicalization doesn't actually fix them, it just makes us less *aware* of the problems that still fundamentally exist. =[

I'd agree if we excluded all pointers from canonicalization. But the semantics of inttoptr and inttoptr-equivalent memory operations are weird; in general, I'm not sure we can recover the original semantics of the code if we throw away the pointer-ness of pointer load/store operations.

To address the issue at hand, I think changing the isNonIntegralPointerType() check to just isPtrOrPtrVectorTy() would be enough. I think that might make sense?

Keeping loads and stores of pointers as pointers to the extent possible doesn't seem like a bad idea, but I'm worried people will feel like this gives a *semantic* guarantee that isn't really there. Fundamentally, LLVM still doesn't currently have typed memory. All of the optimizer is built upon this assumption.

Anyways, while it doesn't seem intrinsically bad to preserve pointer types as much as possible, I feel like the underlying problem should be addressed in a more fundamental way -- that this change will just shift the problem to more complex cases where the frontend happens to use a memcpy or something similar. I wonder if revisiting D75505 makes somewhat more sense, although clearly it would need some different approach to address the compile time issues.

lebedev.ri removed a parent revision: D88788: [SROA] rewritePartition()/findCommonType(): if uses have conflicting type, try getTypePartition() before falling back to largest integral use type (PR47592).Oct 5 2020, 6:22 AM

NFC

Harbormaster completed remote builds in B73991: Diff 296173.Oct 5 2020, 7:06 AM

lebedev.ri added reviewers: nikic, hfinkel.Oct 5 2020, 10:52 AM

lebedev.ri mentioned this in D88842: [InstCombine] inttoptr(load) -> load.Oct 5 2020, 11:25 AM

Keeping loads and stores of pointers as pointers to the extent possible doesn't seem like a bad idea, but I'm worried people will feel like this gives a *semantic* guarantee that isn't really there. Fundamentally, LLVM still doesn't currently have typed memory. All of the optimizer is built upon this assumption.

LLVM currently isn't internally consistent. See https://bugs.llvm.org/show_bug.cgi?id=34548 . I should probably make a LangRef patch so the "pointer aliasing" section indicates there's an issue here.

So, we can't really teach SCEV about this: D88788 (not without the https://bugs.llvm.org/show_bug.cgi?id=47592 at lease)
And we can't recover the situation post-inlining in instcombine: D88842.

It really does look like this fold is actively breaking
otherwise-good IR, in a way that is not recoverable.
And that means, this fold isn't helpful in exposing the passes
that are otherwise unaware of these patterns it produces.

I'm proceeding with this patch.

lebedev.ri added a comment.Oct 5 2020, 12:36 PM

This comment was removed by lebedev.ri.

Closed by commit rGe00f189d392d: [InstCombine] Revert rL226781 "Teach InstCombine to canonicalize loads which… (authored by lebedev.ri). · Explain WhyOct 5 2020, 2:01 PM

This revision was automatically updated to reflect the committed changes.

lebedev.ri added a commit: rGe00f189d392d: [InstCombine] Revert rL226781 "Teach InstCombine to canonicalize loads which….

This ended up having a rather large impact...

Compile-time: https://llvm-compile-time-tracker.com/compare.php?from=567462b48eba1c2d286ce97117994463f4535d2e&to=e00f189d392dd9bf95f6a98f05f2d341d06cd65c&stat=instructions
Code size: https://llvm-compile-time-tracker.com/compare.php?from=567462b48eba1c2d286ce97117994463f4535d2e&to=e00f189d392dd9bf95f6a98f05f2d341d06cd65c&stat=size-text
Largest code size impact is CMakeFiles/pairlocalalign.dir/constants.c.o with a code-size reduction of ~6%.

Always happy about compile-time improvements, but the large code size changes indicate that this has a pretty significant impact on optimization, and as usual it's hard to tell whether it's a good or a bad one :)

lebedev.ri mentioned this in D88860: [LangRef] Describe why the pointer aliasing rules are currently unsound..Oct 5 2020, 11:46 PM

lebedev.ri mentioned this in D88979: [InstCombine] combineLoadToOperationType(): don't fold int<->ptr cast into load.Oct 7 2020, 9:53 AM

lebedev.ri mentioned this in D88995: Support vectors in CastInst::isBitOrNoopPointerCastable.Oct 7 2020, 11:29 PM

lebedev.ri mentioned this in rG544a6aa2674e: [InstCombine] combineLoadToOperationType(): don't fold int<->ptr cast into load.Oct 11 2020, 10:25 AM

lebedev.ri mentioned this in rG1c021c64caef: [SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as zext/trunc/self….Oct 12 2020, 1:04 AM

hans mentioned this in rG17cec6a11a12: Revert 1c021c64c "[SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but….Oct 12 2020, 9:40 AM

lebedev.ri mentioned this in rG1fb610429308: Reland "[SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as….Oct 12 2020, 1:04 PM

jonpa mentioned this in D92985: [SystemZTTIImpl::getMinPrefetchStride] Allow some non-prefetched mem accesses..Dec 9 2020, 5:20 PM

aqjune mentioned this in D100717: [InstCombine] Transform memcpy to ptr load/stores if TBAA says so.Apr 18 2021, 1:38 AM

Revision Contents

Path

Size

clang/

test/

CodeGen/

attr-arm-sve-vector-bits-bitcast.c

18 lines

attr-arm-sve-vector-bits-call.c

78 lines

attr-arm-sve-vector-bits-cast.c

24 lines

attr-arm-sve-vector-bits-globals.c

6 lines

llvm/

lib/

Transforms/

InstCombine/

InstCombineLoadStoreAlloca.cpp

34 lines

test/

Transforms/

InstCombine/

atomic.ll

18 lines

load.ll

44 lines

loadstore-metadata.ll

19 lines

non-integral-pointers.ll

16 lines

PhaseOrdering/

instcombine-sroa-inttoptr.ll

32 lines

Diff 296293

clang/test/CodeGen/attr-arm-sve-vector-bits-bitcast.c

	Show First 20 Lines • Show All 249 Lines • ▼ Show 20 Lines
	// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1, i64 0			// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1, i64 0
	// CHECK-128-NEXT: store <2 x i8> [[TMP1]], <2 x i8>* [[ARRAYIDX]], align 2, [[TBAA6]]			// CHECK-128-NEXT: store <2 x i8> [[TMP1]], <2 x i8>* [[ARRAYIDX]], align 2, [[TBAA6]]
	// CHECK-128-NEXT: ret void			// CHECK-128-NEXT: ret void
	//			//
	// CHECK-256-LABEL: @write_bool(			// CHECK-256-LABEL: @write_bool(
	// CHECK-256-NEXT: entry:			// CHECK-256-NEXT: entry:
	// CHECK-256-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-256-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-256-NEXT: store <vscale x 16 x i1> [[X:%.]], <vscale x 16 x i1> [[X_ADDR]], align 16, [[TBAA15:!tbaa !.*]]			// CHECK-256-NEXT: store <vscale x 16 x i1> [[X:%.]], <vscale x 16 x i1> [[X_ADDR]], align 16, [[TBAA15:!tbaa !.*]]
	// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[X_ADDR]] to i32*			// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[X_ADDR]] to <4 x i8>*
	// CHECK-256-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP0]], align 16, [[TBAA6]]			// CHECK-256-NEXT: [[TMP1:%.]] = load <4 x i8>, <4 x i8> [[TMP0]], align 16, [[TBAA6]]
	// CHECK-256-NEXT: [[Y:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1			// CHECK-256-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1, i64 0
	// CHECK-256-NEXT: [[TMP2:%.]] = bitcast [3 x <4 x i8>] [[Y]] to i32*			// CHECK-256-NEXT: store <4 x i8> [[TMP1]], <4 x i8>* [[ARRAYIDX]], align 2, [[TBAA6]]
	// CHECK-256-NEXT: store i32 [[TMP1]], i32* [[TMP2]], align 2, [[TBAA6]]
	// CHECK-256-NEXT: ret void			// CHECK-256-NEXT: ret void
	//			//
	// CHECK-512-LABEL: @write_bool(			// CHECK-512-LABEL: @write_bool(
	// CHECK-512-NEXT: entry:			// CHECK-512-NEXT: entry:
	// CHECK-512-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-512-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-512-NEXT: store <vscale x 16 x i1> [[X:%.]], <vscale x 16 x i1> [[X_ADDR]], align 16, [[TBAA15:!tbaa !.*]]			// CHECK-512-NEXT: store <vscale x 16 x i1> [[X:%.]], <vscale x 16 x i1> [[X_ADDR]], align 16, [[TBAA15:!tbaa !.*]]
	// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[X_ADDR]] to i64*			// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[X_ADDR]] to <8 x i8>*
	// CHECK-512-NEXT: [[TMP1:%.]] = load i64, i64 [[TMP0]], align 16, [[TBAA6]]			// CHECK-512-NEXT: [[TMP1:%.]] = load <8 x i8>, <8 x i8> [[TMP0]], align 16, [[TBAA6]]
	// CHECK-512-NEXT: [[Y:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1			// CHECK-512-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1, i64 0
	// CHECK-512-NEXT: [[TMP2:%.]] = bitcast [3 x <8 x i8>] [[Y]] to i64*			// CHECK-512-NEXT: store <8 x i8> [[TMP1]], <8 x i8>* [[ARRAYIDX]], align 2, [[TBAA6]]
	// CHECK-512-NEXT: store i64 [[TMP1]], i64* [[TMP2]], align 2, [[TBAA6]]
	// CHECK-512-NEXT: ret void			// CHECK-512-NEXT: ret void
	//			//
	void write_bool(struct struct_bool *s, svbool_t x) {			void write_bool(struct struct_bool *s, svbool_t x) {
	s->y[0] = x;			s->y[0] = x;
	}			}

clang/test/CodeGen/attr-arm-sve-vector-bits-call.c

	Show First 20 Lines • Show All 163 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: [[OP1:%.*]] = alloca <8 x i8>, align 16			// CHECK-NEXT: [[OP1:%.*]] = alloca <8 x i8>, align 16
	// CHECK-NEXT: [[OP2:%.*]] = alloca <8 x i8>, align 16			// CHECK-NEXT: [[OP2:%.*]] = alloca <8 x i8>, align 16
	// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <8 x i8>, align 16			// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <8 x i8>, align 16
	// CHECK-NEXT: [[OP2_ADDR:%.*]] = alloca <8 x i8>, align 16			// CHECK-NEXT: [[OP2_ADDR:%.*]] = alloca <8 x i8>, align 16
	// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[OP1]] to <vscale x 16 x i1>*			// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[OP1]] to <vscale x 16 x i1>*
	// CHECK-NEXT: store <vscale x 16 x i1> [[OP1_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16			// CHECK-NEXT: store <vscale x 16 x i1> [[OP1_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16
	// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[OP1]] to i64*			// CHECK-NEXT: [[OP11:%.]] = load <8 x i8>, <8 x i8> [[OP1]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[OP113:%.]] = load i64, i64 [[TMP1]], align 16, [[TBAA6]]			// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[OP2]] to <vscale x 16 x i1>*
	// CHECK-NEXT: [[TMP2:%.]] = bitcast <8 x i8> [[OP2]] to <vscale x 16 x i1>*			// CHECK-NEXT: store <vscale x 16 x i1> [[OP2_COERCE:%.]], <vscale x 16 x i1> [[TMP1]], align 16
	// CHECK-NEXT: store <vscale x 16 x i1> [[OP2_COERCE:%.]], <vscale x 16 x i1> [[TMP2]], align 16			// CHECK-NEXT: [[OP22:%.]] = load <8 x i8>, <8 x i8> [[OP2]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP3:%.]] = bitcast <8 x i8> [[OP2]] to i64*			// CHECK-NEXT: store <8 x i8> [[OP11]], <8 x i8>* [[OP1_ADDR]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[OP224:%.]] = load i64, i64 [[TMP3]], align 16, [[TBAA6]]			// CHECK-NEXT: store <8 x i8> [[OP22]], <8 x i8>* [[OP2_ADDR]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP4:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to i64*			// CHECK-NEXT: [[TMP2:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to <vscale x 16 x i1>*
	// CHECK-NEXT: store i64 [[OP113]], i64* [[TMP4]], align 16, [[TBAA6]]			// CHECK-NEXT: [[TMP3:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP2]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP5:%.]] = bitcast <8 x i8> [[OP2_ADDR]] to i64*			// CHECK-NEXT: [[TMP4:%.]] = bitcast <8 x i8> [[OP2_ADDR]] to <vscale x 16 x i1>*
	// CHECK-NEXT: store i64 [[OP224]], i64* [[TMP5]], align 16, [[TBAA6]]			// CHECK-NEXT: [[TMP5:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP4]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP6:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to <vscale x 16 x i1>*			// CHECK-NEXT: [[TMP6:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[TMP3]], <vscale x 16 x i1> [[TMP5]])
	// CHECK-NEXT: [[TMP7:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP6]], align 16, [[TBAA6]]			// CHECK-NEXT: store <vscale x 16 x i1> [[TMP6]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, [[TBAA13:!tbaa !.*]]
	// CHECK-NEXT: [[TMP8:%.]] = bitcast <8 x i8> [[OP2_ADDR]] to <vscale x 16 x i1>*			// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to <8 x i8>*
	// CHECK-NEXT: [[TMP9:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP8]], align 16, [[TBAA6]]			// CHECK-NEXT: [[TMP7:%.]] = load <8 x i8>, <8 x i8> [[CASTFIXEDSVE]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP10:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[TMP7]], <vscale x 16 x i1> [[TMP9]])			// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to <8 x i8>*
	// CHECK-NEXT: store <vscale x 16 x i1> [[TMP10]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, [[TBAA13:!tbaa !.*]]			// CHECK-NEXT: store <8 x i8> [[TMP7]], <8 x i8>* [[RETVAL_0__SROA_CAST]], align 16
	// CHECK-NEXT: [[TMP11:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to i64*			// CHECK-NEXT: [[TMP8:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
	// CHECK-NEXT: [[TMP12:%.]] = load i64, i64 [[TMP11]], align 16, [[TBAA6]]			// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP8]]
	// CHECK-NEXT: [[TMP13:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i64*
	// CHECK-NEXT: store i64 [[TMP12]], i64* [[TMP13]], align 16
	// CHECK-NEXT: [[TMP14:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
	// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP14]]
	//			//
	fixed_bool_t call_bool_ff(svbool_t pg, fixed_bool_t op1, fixed_bool_t op2) {			fixed_bool_t call_bool_ff(svbool_t pg, fixed_bool_t op1, fixed_bool_t op2) {
	return svsel(pg, op1, op2);			return svsel(pg, op1, op2);
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// fixed, scalable			// fixed, scalable
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
	// CHECK-LABEL: @call_bool_fs(			// CHECK-LABEL: @call_bool_fs(
	// CHECK-NEXT: entry:			// CHECK-NEXT: entry:
	// CHECK-NEXT: [[OP1:%.*]] = alloca <8 x i8>, align 16			// CHECK-NEXT: [[OP1:%.*]] = alloca <8 x i8>, align 16
	// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <8 x i8>, align 16			// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <8 x i8>, align 16
	// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[OP1]] to <vscale x 16 x i1>*			// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[OP1]] to <vscale x 16 x i1>*
	// CHECK-NEXT: store <vscale x 16 x i1> [[OP1_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16			// CHECK-NEXT: store <vscale x 16 x i1> [[OP1_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16
	// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[OP1]] to i64*			// CHECK-NEXT: [[OP11:%.]] = load <8 x i8>, <8 x i8> [[OP1]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[OP112:%.]] = load i64, i64 [[TMP1]], align 16, [[TBAA6]]			// CHECK-NEXT: store <8 x i8> [[OP11]], <8 x i8>* [[OP1_ADDR]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP2:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to i64*			// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to <vscale x 16 x i1>*
	// CHECK-NEXT: store i64 [[OP112]], i64* [[TMP2]], align 16, [[TBAA6]]			// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP1]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP3:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to <vscale x 16 x i1>*			// CHECK-NEXT: [[TMP3:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[TMP2]], <vscale x 16 x i1> [[OP2:%.*]])
	// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP3]], align 16, [[TBAA6]]			// CHECK-NEXT: store <vscale x 16 x i1> [[TMP3]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, [[TBAA13]]
	// CHECK-NEXT: [[TMP5:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[TMP4]], <vscale x 16 x i1> [[OP2:%.*]])			// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to <8 x i8>*
	// CHECK-NEXT: store <vscale x 16 x i1> [[TMP5]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, [[TBAA13]]			// CHECK-NEXT: [[TMP4:%.]] = load <8 x i8>, <8 x i8> [[CASTFIXEDSVE]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP6:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to i64*			// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to <8 x i8>*
	// CHECK-NEXT: [[TMP7:%.]] = load i64, i64 [[TMP6]], align 16, [[TBAA6]]			// CHECK-NEXT: store <8 x i8> [[TMP4]], <8 x i8>* [[RETVAL_0__SROA_CAST]], align 16
	// CHECK-NEXT: [[TMP8:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i64*			// CHECK-NEXT: [[TMP5:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
	// CHECK-NEXT: store i64 [[TMP7]], i64* [[TMP8]], align 16			// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP5]]
	// CHECK-NEXT: [[TMP9:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
	// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP9]]
	//			//
	fixed_bool_t call_bool_fs(svbool_t pg, fixed_bool_t op1, svbool_t op2) {			fixed_bool_t call_bool_fs(svbool_t pg, fixed_bool_t op1, svbool_t op2) {
	return svsel(pg, op1, op2);			return svsel(pg, op1, op2);
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// scalable, scalable			// scalable, scalable
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	Show All 35 Lines
	}			}

	// CHECK-LABEL: @call_bool_ss(			// CHECK-LABEL: @call_bool_ss(
	// CHECK-NEXT: entry:			// CHECK-NEXT: entry:
	// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-NEXT: [[TMP0:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[OP1:%.]], <vscale x 16 x i1> [[OP2:%.]])			// CHECK-NEXT: [[TMP0:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[OP1:%.]], <vscale x 16 x i1> [[OP2:%.]])
	// CHECK-NEXT: store <vscale x 16 x i1> [[TMP0]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, [[TBAA13]]			// CHECK-NEXT: store <vscale x 16 x i1> [[TMP0]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, [[TBAA13]]
	// CHECK-NEXT: [[TMP1:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to i64*			// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to <8 x i8>*
	// CHECK-NEXT: [[TMP2:%.]] = load i64, i64 [[TMP1]], align 16, [[TBAA6]]			// CHECK-NEXT: [[TMP1:%.]] = load <8 x i8>, <8 x i8> [[CASTFIXEDSVE]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP3:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i64*			// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to <8 x i8>*
	// CHECK-NEXT: store i64 [[TMP2]], i64* [[TMP3]], align 16			// CHECK-NEXT: store <8 x i8> [[TMP1]], <8 x i8>* [[RETVAL_0__SROA_CAST]], align 16
	// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16			// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
	// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP4]]			// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP2]]
	//			//
	fixed_bool_t call_bool_ss(svbool_t pg, svbool_t op1, svbool_t op2) {			fixed_bool_t call_bool_ss(svbool_t pg, svbool_t op1, svbool_t op2) {
	return svsel(pg, op1, op2);			return svsel(pg, op1, op2);
	}			}

clang/test/CodeGen/attr-arm-sve-vector-bits-cast.c

	Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines
	}			}

	// CHECK-LABEL: @to_svbool_t(			// CHECK-LABEL: @to_svbool_t(
	// CHECK-NEXT: entry:			// CHECK-NEXT: entry:
	// CHECK-NEXT: [[TYPE:%.*]] = alloca <8 x i8>, align 16			// CHECK-NEXT: [[TYPE:%.*]] = alloca <8 x i8>, align 16
	// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <8 x i8>, align 16			// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <8 x i8>, align 16
	// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[TYPE]] to <vscale x 16 x i1>*			// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[TYPE]] to <vscale x 16 x i1>*
	// CHECK-NEXT: store <vscale x 16 x i1> [[TYPE_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16			// CHECK-NEXT: store <vscale x 16 x i1> [[TYPE_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16
	// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[TYPE]] to i64*			// CHECK-NEXT: [[TYPE1:%.]] = load <8 x i8>, <8 x i8> [[TYPE]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TYPE12:%.]] = load i64, i64 [[TMP1]], align 16, [[TBAA6]]			// CHECK-NEXT: store <8 x i8> [[TYPE1]], <8 x i8>* [[TYPE_ADDR]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP2:%.]] = bitcast <8 x i8> [[TYPE_ADDR]] to i64*			// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[TYPE_ADDR]] to <vscale x 16 x i1>*
	// CHECK-NEXT: store i64 [[TYPE12]], i64* [[TMP2]], align 16, [[TBAA6]]			// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP1]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP3:%.]] = bitcast <8 x i8> [[TYPE_ADDR]] to <vscale x 16 x i1>*			// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP2]]
	// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP3]], align 16, [[TBAA6]]
	// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP4]]
	//			//
	svbool_t to_svbool_t(fixed_bool_t type) {			svbool_t to_svbool_t(fixed_bool_t type) {
	return type;			return type;
	}			}

	// CHECK-LABEL: @from_svbool_t(			// CHECK-LABEL: @from_svbool_t(
	// CHECK-NEXT: entry:			// CHECK-NEXT: entry:
	// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-NEXT: store <vscale x 16 x i1> [[TYPE:%.]], <vscale x 16 x i1> [[TYPE_ADDR]], align 16, [[TBAA13:!tbaa !.*]]			// CHECK-NEXT: store <vscale x 16 x i1> [[TYPE:%.]], <vscale x 16 x i1> [[TYPE_ADDR]], align 16, [[TBAA13:!tbaa !.*]]
	// CHECK-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[TYPE_ADDR]] to i64*			// CHECK-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[TYPE_ADDR]] to <8 x i8>*
	// CHECK-NEXT: [[TMP1:%.]] = load i64, i64 [[TMP0]], align 16, [[TBAA6]]			// CHECK-NEXT: [[TMP1:%.]] = load <8 x i8>, <8 x i8> [[TMP0]], align 16, [[TBAA6]]
	// CHECK-NEXT: [[TMP2:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i64*			// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to <8 x i8>*
	// CHECK-NEXT: store i64 [[TMP1]], i64* [[TMP2]], align 16			// CHECK-NEXT: store <8 x i8> [[TMP1]], <8 x i8>* [[RETVAL_0__SROA_CAST]], align 16
	// CHECK-NEXT: [[TMP3:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16			// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
	// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP3]]			// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP2]]
	//			//
	fixed_bool_t from_svbool_t(svbool_t type) {			fixed_bool_t from_svbool_t(svbool_t type) {
	return type;			return type;
	}			}

	// CHECK-LABEL: @to_svint32_t__from_gnu_int32_t(			// CHECK-LABEL: @to_svint32_t__from_gnu_int32_t(
	// CHECK-NEXT: entry:			// CHECK-NEXT: entry:
	// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <16 x i32>, align 16			// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <16 x i32>, align 16
	▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

clang/test/CodeGen/attr-arm-sve-vector-bits-globals.c

	Show First 20 Lines • Show All 66 Lines • ▼ Show 20 Lines
	// CHECK-128-NEXT: [[TMP1:%.]] = load <2 x i8>, <2 x i8> [[TMP0]], align 16, [[TBAA10]]			// CHECK-128-NEXT: [[TMP1:%.]] = load <2 x i8>, <2 x i8> [[TMP0]], align 16, [[TBAA10]]
	// CHECK-128-NEXT: store <2 x i8> [[TMP1]], <2 x i8>* @global_bool, align 2, [[TBAA10]]			// CHECK-128-NEXT: store <2 x i8> [[TMP1]], <2 x i8>* @global_bool, align 2, [[TBAA10]]
	// CHECK-128-NEXT: ret void			// CHECK-128-NEXT: ret void
	//			//
	// CHECK-512-LABEL: @write_global_bool(			// CHECK-512-LABEL: @write_global_bool(
	// CHECK-512-NEXT: entry:			// CHECK-512-NEXT: entry:
	// CHECK-512-NEXT: [[V_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16			// CHECK-512-NEXT: [[V_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
	// CHECK-512-NEXT: store <vscale x 16 x i1> [[V:%.]], <vscale x 16 x i1> [[V_ADDR]], align 16, [[TBAA13:!tbaa !.*]]			// CHECK-512-NEXT: store <vscale x 16 x i1> [[V:%.]], <vscale x 16 x i1> [[V_ADDR]], align 16, [[TBAA13:!tbaa !.*]]
	// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[V_ADDR]] to i64*			// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[V_ADDR]] to <8 x i8>*
	// CHECK-512-NEXT: [[TMP1:%.]] = load i64, i64 [[TMP0]], align 16, [[TBAA10]]			// CHECK-512-NEXT: [[TMP1:%.]] = load <8 x i8>, <8 x i8> [[TMP0]], align 16, [[TBAA10]]
	// CHECK-512-NEXT: store i64 [[TMP1]], i64* bitcast (<8 x i8>* @global_bool to i64*), align 2, [[TBAA10]]			// CHECK-512-NEXT: store <8 x i8> [[TMP1]], <8 x i8>* @global_bool, align 2, [[TBAA10]]
	// CHECK-512-NEXT: ret void			// CHECK-512-NEXT: ret void
	//			//
	void write_global_bool(svbool_t v) { global_bool = v; }			void write_global_bool(svbool_t v) { global_bool = v; }

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// READS			// READS
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	Show All 35 Lines

llvm/lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp

Show First 20 Lines • Show All 548 Lines • ▼ Show 20 Lines	static Instruction *combineLoadToOperationType(InstCombinerImpl &IC,

if (LI.use_empty())		if (LI.use_empty())
return nullptr;		return nullptr;

// swifterror values can't be bitcasted.		// swifterror values can't be bitcasted.
if (LI.getPointerOperand()->isSwiftError())		if (LI.getPointerOperand()->isSwiftError())
return nullptr;		return nullptr;

Type *Ty = LI.getType();
const DataLayout &DL = IC.getDataLayout();		const DataLayout &DL = IC.getDataLayout();

// Try to canonicalize loads which are only ever stored to operate over
// integers instead of any other type. We only do this when the loaded type
// is sized and has a size exactly the same as its store size and the store
// size is a legal integer type.
// Do not perform canonicalization if minmax pattern is found (to avoid
// infinite loop).
Type *Dummy;
if (!Ty->isIntegerTy() && Ty->isSized() && !isa<ScalableVectorType>(Ty) &&
DL.isLegalInteger(DL.getTypeStoreSizeInBits(Ty)) &&
DL.typeSizeEqualsStoreSize(Ty) && !DL.isNonIntegralPointerType(Ty) &&
!isMinMaxWithLoads(InstCombiner::peekThroughBitcast(
LI.getPointerOperand(), /OneUseOnly=/true),
Dummy)) {
if (all_of(LI.users(), [&LI](User *U) {
auto *SI = dyn_cast<StoreInst>(U);
return SI && SI->getPointerOperand() != &LI &&
!SI->getPointerOperand()->isSwiftError();
})) {
LoadInst *NewLoad = IC.combineLoadToNewType(
LI, Type::getIntNTy(LI.getContext(), DL.getTypeStoreSizeInBits(Ty)));
// Replace all the stores with stores of the newly loaded value.
for (auto UI = LI.user_begin(), UE = LI.user_end(); UI != UE;) {
auto SI = cast<StoreInst>(UI++);
IC.Builder.SetInsertPoint(SI);
combineStoreToNewValue(IC, *SI, NewLoad);
IC.eraseInstFromFunction(*SI);
}
assert(LI.use_empty() && "Failed to remove all users of the load!");
// Return the old load so the combiner can delete it safely.
return &LI;
}
}

// Fold away bit casts of the loaded value by loading the desired type.		// Fold away bit casts of the loaded value by loading the desired type.
// We can do this for BitCastInsts as well as casts from and to pointer types,		// We can do this for BitCastInsts as well as casts from and to pointer types,
// as long as those are noops (i.e., the source or dest type have the same		// as long as those are noops (i.e., the source or dest type have the same
// bitwidth as the target's pointers).		// bitwidth as the target's pointers).
if (LI.hasOneUse())		if (LI.hasOneUse())
if (auto* CI = dyn_cast<CastInst>(LI.user_back()))		if (auto* CI = dyn_cast<CastInst>(LI.user_back()))
if (CI->isNoopCast(DL))		if (CI->isNoopCast(DL))
if (!LI.isAtomic() \|\| isSupportedAtomicType(CI->getDestTy())) {		if (!LI.isAtomic() \|\| isSupportedAtomicType(CI->getDestTy())) {
▲ Show 20 Lines • Show All 954 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/atomic.ll

	Show First 20 Lines • Show All 319 Lines • ▼ Show 20 Lines
	merge:			merge:
	ret i32 0			ret i32 0
	}			}

	declare void @clobber()			declare void @clobber()

	define i32 @test18(float* %p) {			define i32 @test18(float* %p) {
	; CHECK-LABEL: @test18(			; CHECK-LABEL: @test18(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast float [[P:%.]] to i32			; CHECK-NEXT: [[X:%.]] = load atomic float, float [[P:%.*]] unordered, align 4
	; CHECK-NEXT: [[X1:%.]] = load atomic i32, i32 [[TMP1]] unordered, align 4
	; CHECK-NEXT: call void @clobber()			; CHECK-NEXT: call void @clobber()
	; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[P]] to i32*			; CHECK-NEXT: store atomic float [[X]], float* [[P]] unordered, align 4
	; CHECK-NEXT: store atomic i32 [[X1]], i32* [[TMP2]] unordered, align 4
	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0
	;			;
	%x = load atomic float, float* %p unordered, align 4			%x = load atomic float, float* %p unordered, align 4
	call void @clobber() ;; keep the load around			call void @clobber() ;; keep the load around
	store atomic float %x, float* %p unordered, align 4			store atomic float %x, float* %p unordered, align 4
	ret i32 0			ret i32 0
	}			}

	Show All 30 Lines
	;			;
	%cast = bitcast i8* %v to i32*			%cast = bitcast i8* %v to i32*
	store atomic i32* %cast, i32** %p monotonic, align 4			store atomic i32* %cast, i32** %p monotonic, align 4
	ret i32 0			ret i32 0
	}			}

	define void @pr27490a(i8 %p1, i8 %p2) {			define void @pr27490a(i8 %p1, i8 %p2) {
	; CHECK-LABEL: @pr27490a(			; CHECK-LABEL: @pr27490a(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast i8* [[P1:%.]] to i64			; CHECK-NEXT: [[L:%.]] = load i8, i8** [[P1:%.*]], align 8
	; CHECK-NEXT: [[L1:%.]] = load i64, i64 [[TMP1]], align 8			; CHECK-NEXT: store volatile i8* [[L]], i8** [[P2:%.*]], align 8
	; CHECK-NEXT: [[TMP2:%.]] = bitcast i8* [[P2:%.]] to i64
	; CHECK-NEXT: store volatile i64 [[L1]], i64* [[TMP2]], align 8
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	%l = load i8, i8* %p1			%l = load i8, i8* %p1
	store volatile i8* %l, i8** %p2			store volatile i8* %l, i8** %p2
	ret void			ret void
	}			}

	define void @pr27490b(i8 %p1, i8 %p2) {			define void @pr27490b(i8 %p1, i8 %p2) {
	; CHECK-LABEL: @pr27490b(			; CHECK-LABEL: @pr27490b(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast i8* [[P1:%.]] to i64			; CHECK-NEXT: [[L:%.]] = load i8, i8** [[P1:%.*]], align 8
	; CHECK-NEXT: [[L1:%.]] = load i64, i64 [[TMP1]], align 8			; CHECK-NEXT: store atomic i8* [[L]], i8** [[P2:%.*]] seq_cst, align 8
	; CHECK-NEXT: [[TMP2:%.]] = bitcast i8* [[P2:%.]] to i64
	; CHECK-NEXT: store atomic i64 [[L1]], i64* [[TMP2]] seq_cst, align 8
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	%l = load i8, i8* %p1			%l = load i8, i8* %p1
	store atomic i8* %l, i8** %p2 seq_cst, align 8			store atomic i8* %l, i8** %p2 seq_cst, align 8
	ret void			ret void
	}			}

	;; At the moment, we can't form atomic vectors by folding since these are			;; At the moment, we can't form atomic vectors by folding since these are
	Show All 27 Lines

llvm/test/Transforms/InstCombine/load.ll

	Show First 20 Lines • Show All 199 Lines • ▼ Show 20 Lines
	}			}

	; Check that we canonicalize loads which are only stored to use integer types			; Check that we canonicalize loads which are only stored to use integer types
	; when there is a valid integer type.			; when there is a valid integer type.

	define void @test16(i8* %x, i8* %a, i8* %b, i8* %c) {			define void @test16(i8* %x, i8* %a, i8* %b, i8* %c) {
	; CHECK-LABEL: @test16(			; CHECK-LABEL: @test16(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = bitcast i8 [[X:%.]] to i32			; CHECK-NEXT: [[X_CAST:%.]] = bitcast i8 [[X:%.]] to float
	; CHECK-NEXT: [[X11:%.]] = load i32, i32 [[TMP0]], align 4			; CHECK-NEXT: [[A_CAST:%.]] = bitcast i8 [[A:%.]] to float
	; CHECK-NEXT: [[TMP1:%.]] = bitcast i8 [[A:%.]] to i32			; CHECK-NEXT: [[B_CAST:%.]] = bitcast i8 [[B:%.]] to float
	; CHECK-NEXT: store i32 [[X11]], i32* [[TMP1]], align 4			; CHECK-NEXT: [[X1:%.]] = load float, float [[X_CAST]], align 4
	; CHECK-NEXT: [[TMP2:%.]] = bitcast i8 [[B:%.]] to i32			; CHECK-NEXT: store float [[X1]], float* [[A_CAST]], align 4
	; CHECK-NEXT: store i32 [[X11]], i32* [[TMP2]], align 4			; CHECK-NEXT: store float [[X1]], float* [[B_CAST]], align 4
	; CHECK-NEXT: [[TMP3:%.]] = bitcast i8 [[X]] to i32*			; CHECK-NEXT: [[X2:%.]] = load float, float [[X_CAST]], align 4
	; CHECK-NEXT: [[X22:%.]] = load i32, i32 [[TMP3]], align 4			; CHECK-NEXT: store float [[X2]], float* [[B_CAST]], align 4
	; CHECK-NEXT: [[TMP4:%.]] = bitcast i8 [[B]] to i32*			; CHECK-NEXT: [[TMP0:%.]] = bitcast i8 [[C:%.]] to float
	; CHECK-NEXT: store i32 [[X22]], i32* [[TMP4]], align 4			; CHECK-NEXT: store float [[X2]], float* [[TMP0]], align 4
	; CHECK-NEXT: [[TMP5:%.]] = bitcast i8 [[C:%.]] to i32
	; CHECK-NEXT: store i32 [[X22]], i32* [[TMP5]], align 4
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%x.cast = bitcast i8* %x to float*			%x.cast = bitcast i8* %x to float*
	%a.cast = bitcast i8* %a to float*			%a.cast = bitcast i8* %a to float*
	%b.cast = bitcast i8* %b to float*			%b.cast = bitcast i8* %b to float*
	%c.cast = bitcast i8* %c to i32*			%c.cast = bitcast i8* %c to i32*

	%x1 = load float, float* %x.cast			%x1 = load float, float* %x.cast
	store float %x1, float* %a.cast			store float %x1, float* %a.cast
	store float %x1, float* %b.cast			store float %x1, float* %b.cast

	%x2 = load float, float* %x.cast			%x2 = load float, float* %x.cast
	store float %x2, float* %b.cast			store float %x2, float* %b.cast
	%x2.cast = bitcast float %x2 to i32			%x2.cast = bitcast float %x2 to i32
	store i32 %x2.cast, i32* %c.cast			store i32 %x2.cast, i32* %c.cast

	ret void			ret void
	}			}

	define void @test16-vect(i8* %x, i8* %a, i8* %b, i8* %c) {			define void @test16-vect(i8* %x, i8* %a, i8* %b, i8* %c) {
	; CHECK-LABEL: @test16-vect(			; CHECK-LABEL: @test16-vect(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = bitcast i8 [[X:%.]] to i32			; CHECK-NEXT: [[X_CAST:%.]] = bitcast i8 [[X:%.]] to <4 x i8>
	; CHECK-NEXT: [[X11:%.]] = load i32, i32 [[TMP0]], align 4			; CHECK-NEXT: [[A_CAST:%.]] = bitcast i8 [[A:%.]] to <4 x i8>
	; CHECK-NEXT: [[TMP1:%.]] = bitcast i8 [[A:%.]] to i32			; CHECK-NEXT: [[B_CAST:%.]] = bitcast i8 [[B:%.]] to <4 x i8>
	; CHECK-NEXT: store i32 [[X11]], i32* [[TMP1]], align 4			; CHECK-NEXT: [[X1:%.]] = load <4 x i8>, <4 x i8> [[X_CAST]], align 4
	; CHECK-NEXT: [[TMP2:%.]] = bitcast i8 [[B:%.]] to i32			; CHECK-NEXT: store <4 x i8> [[X1]], <4 x i8>* [[A_CAST]], align 4
	; CHECK-NEXT: store i32 [[X11]], i32* [[TMP2]], align 4			; CHECK-NEXT: store <4 x i8> [[X1]], <4 x i8>* [[B_CAST]], align 4
	; CHECK-NEXT: [[TMP3:%.]] = bitcast i8 [[X]] to i32*			; CHECK-NEXT: [[X2:%.]] = load <4 x i8>, <4 x i8> [[X_CAST]], align 4
	; CHECK-NEXT: [[X22:%.]] = load i32, i32 [[TMP3]], align 4			; CHECK-NEXT: store <4 x i8> [[X2]], <4 x i8>* [[B_CAST]], align 4
	; CHECK-NEXT: [[TMP4:%.]] = bitcast i8 [[B]] to i32*			; CHECK-NEXT: [[TMP0:%.]] = bitcast i8 [[C:%.]] to <4 x i8>
	; CHECK-NEXT: store i32 [[X22]], i32* [[TMP4]], align 4			; CHECK-NEXT: store <4 x i8> [[X2]], <4 x i8>* [[TMP0]], align 4
	; CHECK-NEXT: [[TMP5:%.]] = bitcast i8 [[C:%.]] to i32
	; CHECK-NEXT: store i32 [[X22]], i32* [[TMP5]], align 4
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%x.cast = bitcast i8* %x to <4 x i8>*			%x.cast = bitcast i8* %x to <4 x i8>*
	%a.cast = bitcast i8* %a to <4 x i8>*			%a.cast = bitcast i8* %a to <4 x i8>*
	%b.cast = bitcast i8* %b to <4 x i8>*			%b.cast = bitcast i8* %b to <4 x i8>*
	%c.cast = bitcast i8* %c to i32*			%c.cast = bitcast i8* %c to i32*

	▲ Show 20 Lines • Show All 88 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/loadstore-metadata.ll

Show First 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	loop:
%cmp = icmp slt i32 %i.next, %n		%cmp = icmp slt i32 %i.next, %n
br i1 %cmp, label %loop, label %exit, !llvm.loop !1		br i1 %cmp, label %loop, label %exit, !llvm.loop !1

exit:		exit:
ret void		ret void
}		}

define void @test_load_cast_combine_nonnull(float** %ptr) {		define void @test_load_cast_combine_nonnull(float** %ptr) {
; We can't preserve nonnull metadata when converting a load of a pointer to
; a load of an integer. Instead, we translate it to range metadata.
; FIXME: We should also transform range metadata back into nonnull metadata.
; FIXME: This test is very fragile. If any LABEL lines are added after
; this point, the test will fail, because this test depends on a metadata tuple,
; which is always emitted at the end of the file. At some point, we should
; consider an option to the IR printer to emit MD tuples after the function
; that first uses them--this will allow us to refer to them like this and not
; have the tests break. For now, this function must always come last in this
; file, and no LABEL lines are to be added after this point.
;
; CHECK-LABEL: @test_load_cast_combine_nonnull(		; CHECK-LABEL: @test_load_cast_combine_nonnull(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = bitcast float* [[PTR:%.]] to i64		; CHECK-NEXT: [[P:%.]] = load float, float** [[PTR:%.*]], align 8, !nonnull !7
; CHECK-NEXT: [[P1:%.]] = load i64, i64 [[TMP0]], align 8, !range ![[MD:[0-9]+]]
; CHECK-NEXT: [[GEP:%.]] = getelementptr float, float** [[PTR]], i64 42		; CHECK-NEXT: [[GEP:%.]] = getelementptr float, float** [[PTR]], i64 42
; CHECK-NEXT: [[TMP1:%.]] = bitcast float* [[GEP]] to i64*		; CHECK-NEXT: store float* [[P]], float** [[GEP]], align 8
; CHECK-NEXT: store i64 [[P1]], i64* [[TMP1]], align 8
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
entry:		entry:
%p = load float, float* %ptr, !nonnull !6		%p = load float, float* %ptr, !nonnull !6
%gep = getelementptr float, float* %ptr, i32 42		%gep = getelementptr float, float* %ptr, i32 42
store float* %p, float** %gep		store float* %p, float** %gep
ret void		ret void
}		}

; This is the metadata tuple that we reference above:
; CHECK: ![[MD]] = !{i64 1, i64 0}
!0 = !{!1, !1, i64 0}		!0 = !{!1, !1, i64 0}
!1 = !{!"scalar type", !2}		!1 = !{!"scalar type", !2}
!2 = !{!"root"}		!2 = !{!"root"}
!3 = distinct !{!3, !4}		!3 = distinct !{!3, !4}
!4 = distinct !{!4, !{!"llvm.loop.parallel_accesses", !9}}		!4 = distinct !{!4, !{!"llvm.loop.parallel_accesses", !9}}
!5 = !{i32 0, i32 42}		!5 = !{i32 0, i32 42}
!6 = !{}		!6 = !{}
!7 = !{i32 1}		!7 = !{i32 1}
!8 = !{i64 8}		!8 = !{i64 8}
!9 = distinct !{}		!9 = distinct !{}

llvm/test/Transforms/InstCombine/non-integral-pointers.ll

Show All 35 Lines	entry:
ret void		ret void
}		}

define void @f_3(i8 addrspace(3) %ptr0, i8 addrspace(3) %ptr1) {		define void @f_3(i8 addrspace(3) %ptr0, i8 addrspace(3) %ptr1) {
; It is okay to convert the load/store pair to load and store		; It is okay to convert the load/store pair to load and store
; integers, since pointers in address space 3 are integral.		; integers, since pointers in address space 3 are integral.
; CHECK-LABEL: @f_3(		; CHECK-LABEL: @f_3(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = bitcast i8 addrspace(3)* [[PTR0:%.]] to i64		; CHECK-NEXT: [[VAL:%.]] = load i8 addrspace(3), i8 addrspace(3)** [[PTR0:%.*]], align 8
; CHECK-NEXT: [[VAL1:%.]] = load i64, i64 [[TMP0]], align 8		; CHECK-NEXT: store i8 addrspace(3)* [[VAL]], i8 addrspace(3)** [[PTR1:%.*]], align 8
; CHECK-NEXT: [[TMP1:%.]] = bitcast i8 addrspace(3)* [[PTR1:%.]] to i64
; CHECK-NEXT: store i64 [[VAL1]], i64* [[TMP1]], align 8
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
entry:		entry:
%val = load i8 addrspace(3), i8 addrspace(3)* %ptr0		%val = load i8 addrspace(3), i8 addrspace(3)* %ptr0
store i8 addrspace(3)* %val, i8 addrspace(3)** %ptr1		store i8 addrspace(3)* %val, i8 addrspace(3)** %ptr1
ret void		ret void
}		}

Show All 18 Lines	;
store i8 addrspace(4)* %.pre, i8 addrspace(4)** %v77, align 8		store i8 addrspace(4)* %.pre, i8 addrspace(4)** %v77, align 8
%v80 = bitcast i8 addrspace(4)** %v77 to i64*		%v80 = bitcast i8 addrspace(4)** %v77 to i64*
%v81 = load i64, i64* %v80, align 8		%v81 = load i64, i64* %v80, align 8
ret i64 %v81		ret i64 %v81
}		}

define i64 @g2(i8* addrspace(4)* %gp) {		define i64 @g2(i8* addrspace(4)* %gp) {
; CHECK-LABEL: @g2(		; CHECK-LABEL: @g2(
; CHECK-NEXT: [[TMP1:%.]] = bitcast i8 addrspace(4)* [[GP:%.]] to i64 addrspace(4)		; CHECK-NEXT: [[DOTPRE:%.]] = load i8, i8* addrspace(4)* [[GP:%.*]], align 8
; CHECK-NEXT: [[DOTPRE1:%.]] = load i64, i64 addrspace(4) [[TMP1]], align 8
; CHECK-NEXT: [[V74:%.]] = call i8 addrspace(4) @alloc()		; CHECK-NEXT: [[V74:%.]] = call i8 addrspace(4) @alloc()
; CHECK-NEXT: [[V77:%.]] = getelementptr i8, i8 addrspace(4) [[V74]], i64 -8		; CHECK-NEXT: [[V77:%.]] = getelementptr i8, i8 addrspace(4) [[V74]], i64 -8
; CHECK-NEXT: [[TMP2:%.]] = bitcast i8 addrspace(4) [[V77]] to i64 addrspace(4)*		; CHECK-NEXT: [[TMP1:%.]] = bitcast i8 addrspace(4) [[V77]] to i8* addrspace(4)*
; CHECK-NEXT: store i64 [[DOTPRE1]], i64 addrspace(4)* [[TMP2]], align 8		; CHECK-NEXT: store i8* [[DOTPRE]], i8* addrspace(4)* [[TMP1]], align 8
; CHECK-NEXT: ret i64 [[DOTPRE1]]		; CHECK-NEXT: [[V81_CAST:%.]] = ptrtoint i8 [[DOTPRE]] to i64
		; CHECK-NEXT: ret i64 [[V81_CAST]]
;		;
%.pre = load i8, i8 addrspace(4)* %gp, align 8		%.pre = load i8, i8 addrspace(4)* %gp, align 8
%v74 = call i8 addrspace(4)* @alloc()		%v74 = call i8 addrspace(4)* @alloc()
%v76 = bitcast i8 addrspace(4)* %v74 to i8* addrspace(4)*		%v76 = bitcast i8 addrspace(4)* %v74 to i8* addrspace(4)*
%v77 = getelementptr i8, i8 addrspace(4)* %v76, i64 -1		%v77 = getelementptr i8, i8 addrspace(4)* %v76, i64 -1
store i8* %.pre, i8* addrspace(4)* %v77, align 8		store i8* %.pre, i8* addrspace(4)* %v77, align 8
%v80 = bitcast i8* addrspace(4)* %v77 to i64 addrspace(4)*		%v80 = bitcast i8* addrspace(4)* %v77 to i64 addrspace(4)*
%v81 = load i64, i64 addrspace(4)* %v80, align 8		%v81 = load i64, i64 addrspace(4)* %v80, align 8
Show All 16 Lines

llvm/test/Transforms/PhaseOrdering/instcombine-sroa-inttoptr.ll

	Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines

	target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"

	%0 = type { i32*, i32, i32, i32 }			%0 = type { i32*, i32, i32, i32 }

	define dso_local void @_Z3gen1S(%0* noalias sret align 8 %arg, %0* byval(%0) align 8 %arg1) {			define dso_local void @_Z3gen1S(%0* noalias sret align 8 %arg, %0* byval(%0) align 8 %arg1) {
	; CHECK-LABEL: @_Z3gen1S(			; CHECK-LABEL: @_Z3gen1S(
	; CHECK-NEXT: bb:			; CHECK-NEXT: bb:
	; CHECK-NEXT: [[TMP0:%.]] = bitcast %0 [[ARG1:%.]] to i64			; CHECK-NEXT: [[I:%.]] = getelementptr inbounds [[TMP0:%.]], %0* [[ARG1:%.*]], i64 0, i32 0
	; CHECK-NEXT: [[I21:%.]] = load i64, i64 [[TMP0]], align 8			; CHECK-NEXT: [[I2:%.]] = load i32, i32** [[I]], align 8
	; CHECK-NEXT: [[TMP1:%.]] = bitcast %0 [[ARG:%.]] to i64			; CHECK-NEXT: [[I3:%.]] = getelementptr inbounds [[TMP0]], %0 [[ARG:%.*]], i64 0, i32 0
	; CHECK-NEXT: store i64 [[I21]], i64* [[TMP1]], align 8			; CHECK-NEXT: store i32* [[I2]], i32** [[I3]], align 8
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	bb:			bb:
	%i = getelementptr inbounds %0, %0* %arg1, i32 0, i32 0			%i = getelementptr inbounds %0, %0* %arg1, i32 0, i32 0
	%i2 = load i32, i32* %i, align 8			%i2 = load i32, i32* %i, align 8
	%i3 = getelementptr inbounds %0, %0* %arg, i32 0, i32 0			%i3 = getelementptr inbounds %0, %0* %arg, i32 0, i32 0
	store i32* %i2, i32** %i3, align 8			store i32* %i2, i32** %i3, align 8
	ret void			ret void
	}			}

	define dso_local i32* @_Z3foo1S(%0* byval(%0) align 8 %arg) {			define dso_local i32* @_Z3foo1S(%0* byval(%0) align 8 %arg) {
	; CHECK-LABEL: @_Z3foo1S(			; CHECK-LABEL: @_Z3foo1S(
	; CHECK-NEXT: bb:			; CHECK-NEXT: bb:
	; CHECK-NEXT: [[I2:%.]] = alloca [[TMP0:%.]], align 8			; CHECK-NEXT: [[I2:%.]] = alloca [[TMP0:%.]], align 8
	; CHECK-NEXT: [[I1_SROA_0_0_I5_SROA_CAST:%.]] = bitcast %0 [[ARG:%.]] to i64			; CHECK-NEXT: [[I1_SROA_0_0_I5_SROA_IDX:%.]] = getelementptr inbounds [[TMP0]], %0 [[ARG:%.*]], i64 0, i32 0
	; CHECK-NEXT: [[I1_SROA_0_0_COPYLOAD:%.]] = load i64, i64 [[I1_SROA_0_0_I5_SROA_CAST]], align 8			; CHECK-NEXT: [[I1_SROA_0_0_COPYLOAD:%.]] = load i32, i32** [[I1_SROA_0_0_I5_SROA_IDX]], align 8
	; CHECK-NEXT: [[I_SROA_0_0_I6_SROA_CAST:%.]] = bitcast %0 [[I2]] to i64*			; CHECK-NEXT: [[I_SROA_0_0_I6_SROA_IDX:%.]] = getelementptr inbounds [[TMP0]], %0 [[I2]], i64 0, i32 0
	; CHECK-NEXT: store i64 [[I1_SROA_0_0_COPYLOAD]], i64* [[I_SROA_0_0_I6_SROA_CAST]], align 8			; CHECK-NEXT: store i32* [[I1_SROA_0_0_COPYLOAD]], i32** [[I_SROA_0_0_I6_SROA_IDX]], align 8
	; CHECK-NEXT: tail call void @_Z7escape01S(%0* nonnull byval(%0) align 8 [[I2]])			; CHECK-NEXT: tail call void @_Z7escape01S(%0* nonnull byval(%0) align 8 [[I2]])
	; CHECK-NEXT: [[TMP0]] = inttoptr i64 [[I1_SROA_0_0_COPYLOAD]] to i32*			; CHECK-NEXT: ret i32* [[I1_SROA_0_0_COPYLOAD]]
	; CHECK-NEXT: ret i32* [[TMP0]]
	;			;
	bb:			bb:
	%i = alloca %0, align 8			%i = alloca %0, align 8
	%i1 = alloca %0, align 8			%i1 = alloca %0, align 8
	%i2 = alloca %0, align 8			%i2 = alloca %0, align 8
	%i3 = bitcast %0* %i to i8*			%i3 = bitcast %0* %i to i8*
	call void @llvm.lifetime.start.p0i8(i64 24, i8* %i3)			call void @llvm.lifetime.start.p0i8(i64 24, i8* %i3)
	%i4 = bitcast %0* %i1 to i8*			%i4 = bitcast %0* %i1 to i8*
	Show All 17 Lines

	declare dso_local void @_Z7escape01S(%0* byval(%0) align 8)			declare dso_local void @_Z7escape01S(%0* byval(%0) align 8)

	declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture)			declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture)

	define dso_local i32* @_Z3bar1S(%0* byval(%0) align 8 %arg) {			define dso_local i32* @_Z3bar1S(%0* byval(%0) align 8 %arg) {
	; CHECK-LABEL: @_Z3bar1S(			; CHECK-LABEL: @_Z3bar1S(
	; CHECK-NEXT: bb:			; CHECK-NEXT: bb:
	; CHECK-NEXT: [[I1_SROA_0_0_I4_SROA_CAST:%.]] = bitcast %0 [[ARG:%.]] to i64			; CHECK-NEXT: [[I1_SROA_0_0_I4_SROA_IDX:%.]] = getelementptr inbounds [[TMP0:%.]], %0* [[ARG:%.*]], i64 0, i32 0
	; CHECK-NEXT: [[I1_SROA_0_0_COPYLOAD:%.]] = load i64, i64 [[I1_SROA_0_0_I4_SROA_CAST]], align 8			; CHECK-NEXT: [[I1_SROA_0_0_COPYLOAD:%.]] = load i32, i32** [[I1_SROA_0_0_I4_SROA_IDX]], align 8
	; CHECK-NEXT: [[I5:%.*]] = tail call i32 @_Z4condv()			; CHECK-NEXT: [[I5:%.*]] = tail call i32 @_Z4condv()
	; CHECK-NEXT: [[I6_NOT:%.*]] = icmp eq i32 [[I5]], 0			; CHECK-NEXT: [[I6_NOT:%.*]] = icmp eq i32 [[I5]], 0
	; CHECK-NEXT: br i1 [[I6_NOT]], label [[BB10:%.]], label [[BB7:%.]]			; CHECK-NEXT: br i1 [[I6_NOT]], label [[BB10:%.]], label [[BB7:%.]]
	; CHECK: bb7:			; CHECK: bb7:
	; CHECK-NEXT: tail call void @_Z5sync0v()			; CHECK-NEXT: tail call void @_Z5sync0v()
	; CHECK-NEXT: [[TMP0:%.]] = inttoptr i64 [[I1_SROA_0_0_COPYLOAD]] to i32			; CHECK-NEXT: tail call void @_Z7escape0Pi(i32* [[I1_SROA_0_0_COPYLOAD]])
	; CHECK-NEXT: tail call void @_Z7escape0Pi(i32* [[TMP0]])
	; CHECK-NEXT: br label [[BB13:%.*]]			; CHECK-NEXT: br label [[BB13:%.*]]
	; CHECK: bb10:			; CHECK: bb10:
	; CHECK-NEXT: tail call void @_Z5sync1v()			; CHECK-NEXT: tail call void @_Z5sync1v()
	; CHECK-NEXT: [[TMP1:%.]] = inttoptr i64 [[I1_SROA_0_0_COPYLOAD]] to i32			; CHECK-NEXT: tail call void @_Z7escape1Pi(i32* [[I1_SROA_0_0_COPYLOAD]])
	; CHECK-NEXT: tail call void @_Z7escape1Pi(i32* [[TMP1]])
	; CHECK-NEXT: br label [[BB13]]			; CHECK-NEXT: br label [[BB13]]
	; CHECK: bb13:			; CHECK: bb13:
	; CHECK-NEXT: [[DOTPRE_PHI:%.]] = phi i32 [ [[TMP1]], [[BB10]] ], [ [[TMP0]], [[BB7]] ]			; CHECK-NEXT: ret i32* [[I1_SROA_0_0_COPYLOAD]]
	; CHECK-NEXT: ret i32* [[DOTPRE_PHI]]
	;			;
	bb:			bb:
	%i = alloca %0, align 8			%i = alloca %0, align 8
	%i1 = alloca %0, align 8			%i1 = alloca %0, align 8
	%i2 = bitcast %0* %i to i8*			%i2 = bitcast %0* %i to i8*
	call void @llvm.lifetime.start.p0i8(i64 24, i8* %i2)			call void @llvm.lifetime.start.p0i8(i64 24, i8* %i2)
	%i3 = bitcast %0* %i1 to i8*			%i3 = bitcast %0* %i1 to i8*
	%i4 = bitcast %0* %arg to i8*			%i4 = bitcast %0* %arg to i8*
	Show All 33 Lines