This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Instrumentation/
-
Transforms/
-
Instrumentation/
-
MemorySanitizer.cpp
-
test/Instrumentation/MemorySanitizer/
-
Instrumentation/
-
MemorySanitizer/
-
opaque-ptr.ll

Differential D123602

[MSan] Ensure argument shadow initialized on memcpy
ClosedPublic

Authored by nikic on Apr 12 2022, 6:19 AM.

Download Raw Diff

Details

Reviewers

vitalybuka
eugenis

Commits

rG0adadfa68f82: [MSan] Ensure argument shadow initialized on memcpy
rG163a9f4552be: [MSan] Ensure argument shadow initialized on memcpy

Summary

We need to explicitly query the shadow here, because it is lazily initialized for arguments. Without opaque pointers this used to mostly work out, because there would be a bitcast to i8* present, and that would query the argument shadow.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nikic created this revision.Apr 12 2022, 6:19 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 12 2022, 6:19 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

nikic requested review of this revision.Apr 12 2022, 6:19 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 12 2022, 6:19 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

nikic mentioned this in D123300: [Clang] Enable opaque pointers by default.Apr 12 2022, 6:19 AM

Harbormaster completed remote builds in B159224: Diff 422208.Apr 12 2022, 7:09 AM

I am not sure how this can help, I will try later today.
As I see it it just calculates the shadow address and these instructions will be removed as unused?

In D123602#3445885, @vitalybuka wrote:

I am not sure how this can help, I will try later today.
As I see it it just calculates the shadow address and these instructions will be removed as unused?

Shadow memory for arguments is lazily initialized. If we never call getShadow() for an argument, then we'll leave it uninitialized. In most cases this happens naturally, but for memcpy/memmove we call a builtin that will access the shadow internally, so we never perform the getShadow() call and never initialize the shadow.

In D123602#3445899, @nikic wrote:

In D123602#3445885, @vitalybuka wrote:

I am not sure how this can help, I will try later today.
As I see it it just calculates the shadow address and these instructions will be removed as unused?

Shadow memory for arguments is lazily initialized. If we never call getShadow() for an argument, then we'll leave it uninitialized. In most cases this happens naturally, but for memcpy/memmove we call a builtin that will access the shadow internally, so we never perform the getShadow() call and never initialize the shadow.

I see. I suspect it does not work if instead of llvm.memcpy we have "call void @foo(i8* %p)" and "call void @foo_noundef(i8* noundef %p)"?

Most likely it would, as visitCallBase has it on all code paths, but could you add this into the test?

vitalybuka accepted this revision.Apr 12 2022, 9:52 AM

This revision is now accepted and ready to land.Apr 12 2022, 9:52 AM

Could you clarify that the problem is specifically with byval argument in the patch description?

Also, I'm wondering if we should make this copy explicit, perhaps by checking if the argument has any uses in the function body. I'm not 100% convinced that this is the only place where we forget to copy the shadow.

This change LGTM as a quick fix.

This revision was landed with ongoing or failed builds.Apr 12 2022, 2:50 PM

Closed by commit rG163a9f4552be: [MSan] Ensure argument shadow initialized on memcpy (authored by vitalybuka). · Explain Why

This revision was automatically updated to reflect the committed changes.

vitalybuka added a commit: rG163a9f4552be: [MSan] Ensure argument shadow initialized on memcpy.

vitalybuka added a commit: rG0adadfa68f82: [MSan] Ensure argument shadow initialized on memcpy.Apr 12 2022, 2:53 PM

vitalybuka added a reverting change: rGefdc90baaaeb: Revert "[MSan] Ensure argument shadow initialized on memcpy".

Works locally, so I landed it to fix the bot.
I will address my and @eugenis suggestions separately.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Instrumentation/

MemorySanitizer.cpp

2 lines

test/

Instrumentation/

MemorySanitizer/

opaque-ptr.ll

35 lines

Diff 422340

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp

Show First 20 Lines • Show All 2,564 Lines • ▼ Show 20 Lines	#endif
/// If we don't instrument it and it gets inlined,		/// If we don't instrument it and it gets inlined,
/// our interceptor will not kick in and we will lose the memmove.		/// our interceptor will not kick in and we will lose the memmove.
/// If we instrument the call here, but it does not get inlined,		/// If we instrument the call here, but it does not get inlined,
/// we will memove the shadow twice: which is bad in case		/// we will memove the shadow twice: which is bad in case
/// of overlapping regions. So, we simply lower the intrinsic to a call.		/// of overlapping regions. So, we simply lower the intrinsic to a call.
///		///
/// Similar situation exists for memcpy and memset.		/// Similar situation exists for memcpy and memset.
void visitMemMoveInst(MemMoveInst &I) {		void visitMemMoveInst(MemMoveInst &I) {
		getShadow(I.getArgOperand(1)); // Ensure shadow initialized
IRBuilder<> IRB(&I);		IRBuilder<> IRB(&I);
IRB.CreateCall(		IRB.CreateCall(
MS.MemmoveFn,		MS.MemmoveFn,
{IRB.CreatePointerCast(I.getArgOperand(0), IRB.getInt8PtrTy()),		{IRB.CreatePointerCast(I.getArgOperand(0), IRB.getInt8PtrTy()),
IRB.CreatePointerCast(I.getArgOperand(1), IRB.getInt8PtrTy()),		IRB.CreatePointerCast(I.getArgOperand(1), IRB.getInt8PtrTy()),
IRB.CreateIntCast(I.getArgOperand(2), MS.IntptrTy, false)});		IRB.CreateIntCast(I.getArgOperand(2), MS.IntptrTy, false)});
I.eraseFromParent();		I.eraseFromParent();
}		}

// Similar to memmove: avoid copying shadow twice.		// Similar to memmove: avoid copying shadow twice.
// This is somewhat unfortunate as it may slowdown small constant memcpys.		// This is somewhat unfortunate as it may slowdown small constant memcpys.
// FIXME: consider doing manual inline for small constant sizes and proper		// FIXME: consider doing manual inline for small constant sizes and proper
// alignment.		// alignment.
void visitMemCpyInst(MemCpyInst &I) {		void visitMemCpyInst(MemCpyInst &I) {
		getShadow(I.getArgOperand(1)); // Ensure shadow initialized
IRBuilder<> IRB(&I);		IRBuilder<> IRB(&I);
IRB.CreateCall(		IRB.CreateCall(
MS.MemcpyFn,		MS.MemcpyFn,
{IRB.CreatePointerCast(I.getArgOperand(0), IRB.getInt8PtrTy()),		{IRB.CreatePointerCast(I.getArgOperand(0), IRB.getInt8PtrTy()),
IRB.CreatePointerCast(I.getArgOperand(1), IRB.getInt8PtrTy()),		IRB.CreatePointerCast(I.getArgOperand(1), IRB.getInt8PtrTy()),
IRB.CreateIntCast(I.getArgOperand(2), MS.IntptrTy, false)});		IRB.CreateIntCast(I.getArgOperand(2), MS.IntptrTy, false)});
I.eraseFromParent();		I.eraseFromParent();
}		}
▲ Show 20 Lines • Show All 2,796 Lines • Show Last 20 Lines

llvm/test/Instrumentation/MemorySanitizer/opaque-ptr.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S -passes=msan < %s \| FileCheck %s

				target triple = "x86_64-unknown-linux-gnu"

				define void @test_memcpy(ptr %p, ptr byval(i32) %p2) sanitize_memory {
				; CHECK-LABEL: @test_memcpy(
				; CHECK-NEXT: [[TMP1:%.]] = ptrtoint ptr [[P2:%.]] to i64
				; CHECK-NEXT: [[TMP2:%.*]] = xor i64 [[TMP1]], 87960930222080
				; CHECK-NEXT: [[TMP3:%.*]] = inttoptr i64 [[TMP2]] to ptr
				; CHECK-NEXT: call void @llvm.memcpy.p0.p0.i64(ptr align 4 [[TMP3]], ptr align 4 inttoptr (i64 add (i64 ptrtoint (ptr @__msan_param_tls to i64), i64 8) to ptr), i64 4, i1 false)
				; CHECK-NEXT: call void @llvm.donothing()
				; CHECK-NEXT: [[TMP4:%.]] = call ptr @__msan_memcpy(ptr [[P:%.]], ptr [[P2]], i64 4)
				; CHECK-NEXT: ret void
				;
				call void @llvm.memcpy.p0.p0.i64(i8* %p, i8* %p2, i64 4, i1 false)
				ret void
				}

				define void @test_memmove(ptr %p, ptr byval(i32) %p2) sanitize_memory {
				; CHECK-LABEL: @test_memmove(
				; CHECK-NEXT: [[TMP1:%.]] = ptrtoint ptr [[P2:%.]] to i64
				; CHECK-NEXT: [[TMP2:%.*]] = xor i64 [[TMP1]], 87960930222080
				; CHECK-NEXT: [[TMP3:%.*]] = inttoptr i64 [[TMP2]] to ptr
				; CHECK-NEXT: call void @llvm.memcpy.p0.p0.i64(ptr align 4 [[TMP3]], ptr align 4 inttoptr (i64 add (i64 ptrtoint (ptr @__msan_param_tls to i64), i64 8) to ptr), i64 4, i1 false)
				; CHECK-NEXT: call void @llvm.donothing()
				; CHECK-NEXT: [[TMP4:%.]] = call ptr @__msan_memmove(ptr [[P:%.]], ptr [[P2]], i64 4)
				; CHECK-NEXT: ret void
				;
				call void @llvm.memmove.p0.p0.i64(i8* %p, i8* %p2, i64 4, i1 false)
				ret void
				}

				declare void @llvm.memcpy.p0.p0.i64(i8, i8, i64, i1)
				declare void @llvm.memmove.p0.p0.i64(i8, i8, i64, i1)