Download Raw Diff

Details

Reviewers

craig.topper
spatel
andreadb
wristow

Commits

rG4f799c027e09: [X86] Mark all byval parameters as aliased
rL331749: [X86] Mark all byval parameters as aliased

Summary

The formal LLVM IR Value * to the function is guaranteed to alias this stack slot, thus later rescheduling optimisations of accesses to this stack slot may be aliased and unsafe. See: PR30290

Diff Detail

Event Timeline

jmorse created this revision.Mar 29 2018, 6:09 AM

Herald added a subscriber: llvm-commits. · View Herald TranscriptMar 29 2018, 6:09 AM

Test?

Hi, more sharing it to make clear in https://bugs.llvm.org/show_bug.cgi?id=30290 what the solution may be. I figured I'd add reviewers if it was agreed to be a fix, but if this kind of discussion-patch should go somewhere else, do let me know.

Added regression test and craig.topper as reviewer; Hi Craig, this is a fix for https://bugs.llvm.org/show_bug.cgi?id=30290 which has gone quiet. The tl;dr is that without isAliased=true for byval argument stack slots, the instruction scheduler doesn't know the byval stack slot is aliased by an IR Value, and can illegally re-order frameindex-based and Value-based memory accesses of the same location, as demonstrated in PR and added regression test.

craig.topper added a reviewer: spatel.Apr 10 2018, 8:13 AM

spatel added reviewers: andreadb, wristow.Apr 10 2018, 11:47 AM

wristow added inline comments.Apr 24 2018, 6:51 PM

lib/Target/X86/X86ISelLowering.cpp
2836	The code-change itself looks safe to me, and my understanding of the discussion in http://llvm.org/PR30290 is that this approach is conservative in terms of the aliasing that it will mark (but that identifying the precisely-minimal-aliasing will require a very large amount of restructuring). My guess is that this conservative aliasing won't have a serious impact on overall code quality. Like the FIXME comment above (line 2829), I think similar comment would be good. Something like: // FIXME: For now, all byval parameter objects are marked as aliasing. This // can be improved with deeper analysis. But I have to say I don't have a deep understanding of the area, so I'm curious to hear what others think. Formatting nit: With the added `true` parameter, the line is more than 80 characters. Also, I'd prefer a comment indicating what the parameter being set to `true` is doing: int FI = MFI.CreateFixedObject(Bytes, VA.getLocMemOffset(), isImmutable, /isAliased=/true);

Could you please add an llc test instead of an lli test?

Updated with comment + cosmetics to the CreateFixedObject call, changed test to be an llc codegen test.

A couple minor test-tweak comments in-line.
As before, the code-change looks safe and conservative to me, in terms of the aliasing. And my guess is that conservative aspect is fine, in that it won't seriously impact performance. So I'm happy to say LGTM. Does anyone with more experience in this area have any concerns?

test/CodeGen/X86/pr30290.ll
3	'bar' or 'baz'?
20	Remove this XXX line?

In D45022#1084793, @wristow wrote:

A couple minor test-tweak comments in-line.
As before, the code-change looks safe and conservative to me, in terms of the aliasing. And my guess is that conservative aspect is fine, in that it won't seriously impact performance. So I'm happy to say LGTM. Does anyone with more experience in this area have any concerns?

I don't have any better suggestions about the code change, but let me chime in with a general comment about the test: it would be better to commit the test in trunk without this code change, so we have the baseline (miscompile). That way, if there are any temptations to revert the code change because of a perf problem, it will be clear that we would be reintroducing a miscompile.

Also, please use utils/update_mir_test_checks.py or utils/update_llc_test_checks.py to generate the CHECK lines. That's better than adding assertions by hand in almost all cases.

Improves comments in test description, use utils/update_llc_test_checks.py to generate CHECK lines in test.

@spatel just to confirm I understand you correctly, you're saying to commit the test first (where it would fail) followed by the code change that fixes it, to discourage unconsidered reversion, yes?

In D45022#1086594, @jmorse wrote:

Improves comments in test description, use utils/update_llc_test_checks.py to generate CHECK lines in test.

@spatel just to confirm I understand you correctly, you're saying to commit the test first (where it would fail) followed by the code change that fixes it, to discourage unconsidered reversion, yes?

Correct. You can commit the test right now (with auto-generated CHECK lines that show the bug). Add a FIXME comment too to be extra clear that the output shown is a miscompile.
After that commit, apply your code fix locally, rebuild llc, and regenerate the CHECK lines using the script. There will only be a couple of lines changing in the test file now. Upload the complete rebased diff here. Then, we'll have a patch that clearly shows that the miscompile that was present will be fixed with this patch.

jmorse mentioned this in rL331514: [X86] Add test case for PR30290s failing behaviour.May 4 2018, 3:08 AM

Rebase against now-committed test case. The patch against the test is indeed much clearer now, it's obvious what the effect of the code change is.

LGTM.

This revision is now accepted and ready to land.May 4 2018, 7:02 AM

Note: I marked this patch as 'accepted' on Phab over an hour ago, but I
haven't gotten an email about it, so sending 'LGTM' via email.

Closed by commit rL331749: [X86] Mark all byval parameters as aliased (authored by jmorse). · Explain WhyMay 8 2018, 2:21 AM

This revision was automatically updated to reflect the committed changes.

Diff 141811

lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,827 Lines • ▼ Show 20 Lines	X86TargetLowering::LowerMemArgument(SDValue Chain, CallingConv::ID CallConv,

// FIXME: For now, all byval parameter objects are marked mutable. This can be		// FIXME: For now, all byval parameter objects are marked mutable. This can be
// changed with more analysis.		// changed with more analysis.
// In case of tail call optimization mark all arguments mutable. Since they		// In case of tail call optimization mark all arguments mutable. Since they
// could be overwritten by lowering of arguments in case of a tail call.		// could be overwritten by lowering of arguments in case of a tail call.
if (Flags.isByVal()) {		if (Flags.isByVal()) {
unsigned Bytes = Flags.getByValSize();		unsigned Bytes = Flags.getByValSize();
if (Bytes == 0) Bytes = 1; // Don't create zero-sized stack objects.		if (Bytes == 0) Bytes = 1; // Don't create zero-sized stack objects.
int FI = MFI.CreateFixedObject(Bytes, VA.getLocMemOffset(), isImmutable);		int FI = MFI.CreateFixedObject(Bytes, VA.getLocMemOffset(), isImmutable, true);
		wristowUnsubmitted Not Done Reply Inline Actions The code-change itself looks safe to me, and my understanding of the discussion in http://llvm.org/PR30290 is that this approach is conservative in terms of the aliasing that it will mark (but that identifying the precisely-minimal-aliasing will require a very large amount of restructuring). My guess is that this conservative aliasing won't have a serious impact on overall code quality. Like the FIXME comment above (line 2829), I think similar comment would be good. Something like: // FIXME: For now, all byval parameter objects are marked as aliasing. This // can be improved with deeper analysis. But I have to say I don't have a deep understanding of the area, so I'm curious to hear what others think. Formatting nit: With the added `true` parameter, the line is more than 80 characters. Also, I'd prefer a comment indicating what the parameter being set to `true` is doing: int FI = MFI.CreateFixedObject(Bytes, VA.getLocMemOffset(), isImmutable, /isAliased=/true); wristow: The code-change itself looks safe to me, and my understanding of the discussion in http://llvm.
// Adjust SP offset of interrupt parameter.		// Adjust SP offset of interrupt parameter.
if (CallConv == CallingConv::X86_INTR) {		if (CallConv == CallingConv::X86_INTR) {
MFI.setObjectOffset(FI, Offset);		MFI.setObjectOffset(FI, Offset);
}		}
return DAG.getFrameIndex(FI, PtrVT);		return DAG.getFrameIndex(FI, PtrVT);
}		}

// This is an argument in memory. We might be able to perform copy elision.		// This is an argument in memory. We might be able to perform copy elision.
▲ Show 20 Lines • Show All 36,933 Lines • Show Last 20 Lines

test/CodeGen/X86/pr30290.ll

This file was added.

				; RUN: lli -mcpu=btver2 %s \| FileCheck %s
				; CHECK: 5
				; Test desc: two functions (foo, bar) with byval arguments, should not have
				wristowUnsubmitted Not Done Reply Inline Actions 'bar' or 'baz'? wristow: 'bar' or 'baz'?
				; reads/writes from/to byval storage re-ordered.
				source_filename = "test.c"
				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-pc-linux-gnu"

				%struct.face = type { [7 x i32] }

				@.str = private unnamed_addr constant [4 x i8] c"%d\0A\00", align 1

				; Function Attrs: noinline nounwind uwtable
				define void @baz(%struct.face* byval nocapture readonly align 8) local_unnamed_addr #0 {
				%2 = getelementptr inbounds %struct.face, %struct.face* %0, i64 0, i32 0, i64 0
				%3 = load i32, i32* %2, align 8, !tbaa !2
				%4 = getelementptr inbounds %struct.face, %struct.face* %0, i64 0, i32 0, i64 1
				%5 = load i32, i32* %4, align 4, !tbaa !2
				%6 = getelementptr inbounds %struct.face, %struct.face* %0, i64 0, i32 0, i64 2
				%7 = load i32, i32* %6, align 8, !tbaa !2
				wristowUnsubmitted Not Done Reply Inline Actions Remove this XXX line? wristow: Remove this XXX line?
				%8 = getelementptr inbounds %struct.face, %struct.face* %0, i64 0, i32 0, i64 3
				%9 = bitcast i32* %8 to <4 x i32>*
				%10 = load <4 x i32>, <4 x i32>* %9, align 4, !tbaa !2
				%11 = shufflevector <4 x i32> %10, <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 undef, i32 undef>
				%12 = add nsw <4 x i32> %10, %11
				%13 = shufflevector <4 x i32> %12, <4 x i32> undef, <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef>
				%14 = add nsw <4 x i32> %12, %13
				%15 = extractelement <4 x i32> %14, i32 0
				%16 = add nsw i32 %15, %7
				%17 = add nsw i32 %16, %5
				%18 = add nsw i32 %17, %3
				%19 = tail call i32 (i8, ...) @printf(i8 getelementptr inbounds ([4 x i8], [4 x i8]* @.str, i64 0, i64 0), i32 %18)
				ret void
				}

				; Function Attrs: argmemonly nounwind
				declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture) #1

				; Function Attrs: argmemonly nounwind
				declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture) #1

				; Function Attrs: nounwind
				declare i32 @printf(i8* nocapture readonly, ...) local_unnamed_addr #2

				; Function Attrs: noinline nounwind uwtable
				define void @foo(%struct.face* byval nocapture align 8) local_unnamed_addr #0 {
				%2 = bitcast %struct.face* %0 to <4 x i32>*
				store <4 x i32> <i32 1, i32 1, i32 1, i32 1>, <4 x i32>* %2, align 8, !tbaa !2
				%3 = getelementptr inbounds %struct.face, %struct.face* %0, i64 0, i32 0, i64 4
				store i32 1, i32* %3, align 8, !tbaa !2
				; XXX XXX XXX
				; Fault happens here: five "1" constants have just been written into the byval
				; %struct.face, but the subsequent byval read of that struct (next call)
				; gets re-ordered with those writes, illegally.
				call void @baz(%struct.face* byval nonnull align 8 %0)
				ret void
				}

				; Function Attrs: noinline nounwind uwtable
				define i32 @main() local_unnamed_addr #0 {
				%1 = alloca %struct.face, align 8
				%2 = bitcast %struct.face* %1 to i8*
				call void @llvm.lifetime.start.p0i8(i64 28, i8* nonnull %2) #3
				call void @llvm.memset.p0i8.i64(i8* nonnull %2, i8 0, i64 28, i32 8, i1 false)
				call void @foo(%struct.face* byval nonnull align 8 %1)
				call void @llvm.lifetime.end.p0i8(i64 28, i8* nonnull %2) #3
				ret i32 0
				}

				; Function Attrs: argmemonly nounwind
				declare void @llvm.memset.p0i8.i64(i8* nocapture writeonly, i8, i64, i32, i1) #1

				attributes #0 = { noinline nounwind uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="btver2" "target-features"="+aes,+avx,+bmi,+cx16,+f16c,+fxsr,+lzcnt,+mmx,+movbe,+pclmul,+popcnt,+prfchw,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+sse4a,+ssse3,+x87,+xsave,+xsaveopt" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #1 = { argmemonly nounwind }
				attributes #2 = { nounwind "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="btver2" "target-features"="+aes,+avx,+bmi,+cx16,+f16c,+fxsr,+lzcnt,+mmx,+movbe,+pclmul,+popcnt,+prfchw,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+sse4a,+ssse3,+x87,+xsave,+xsaveopt" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #3 = { nounwind }

				!llvm.module.flags = !{!0}
				!llvm.ident = !{!1}

				!0 = !{i32 1, !"wchar_size", i32 4}
				!1 = !{!"clang version 6.0.0-svn326550-1~exp1~20180404173613.65 (branches/release_60)"}
				!2 = !{!3, !3, i64 0}
				!3 = !{!"int", !4, i64 0}
				!4 = !{!"omnipotent char", !5, i64 0}
				!5 = !{!"Simple C/C++ TBAA"}

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Mark all byval parameters as aliased
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 141811

lib/Target/X86/X86ISelLowering.cpp

test/CodeGen/X86/pr30290.ll

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Mark all byval parameters as aliasedClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 141811

lib/Target/X86/X86ISelLowering.cpp

test/CodeGen/X86/pr30290.ll

[X86] Mark all byval parameters as aliased
ClosedPublic