This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
2/4
SimplifyLibCalls.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
1/1
snprintf-memccpy.ll

Differential D67986

[InstCombine] snprintf (d, size, "%s", s) -> memccpy (d, s, '\0', size - 1), d[size - 1] = 0
AbandonedPublic

Authored by xbolva00 on Sep 24 2019, 2:25 PM.

Download Raw Diff

Details

Reviewers

efriedma
jdoerfert
nickdesaulniers

Summary

snprintf (d, size, "%s", s)
->
memccpy (d, s, '\0', size - 1),
d[size - 1] = 0

memccpy is much faster than snprintf my microbenchmark

time ./snprintf.out 1000000

real 0m0,057s
user 0m0,057s
sys 0m0,000s

time ./memccpy.out 1000000

real 0m0,021s
user 0m0,021s
sys 0m0,000s

Diff Detail

Event Timeline

xbolva00 created this revision.Sep 24 2019, 2:25 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 24 2019, 2:25 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Do we have accurate TLI information for memccpy? Not sure exactly how widely available it is.

If we're going to start generating memccpy calls, do we need to implement optimizations for it? For example, memccpy of a constant string can be transformed to memcpy.

There is a comment in TLI:

     // Win32 does not support these functions, but
     // they are generally available on POSIX-compliant systems.
...
         TLI.setUnavailable(LibFunc_memccpy);

If we're going to start generating memccpy calls, do we need to implement optimizations for it? For example, memccpy of a constant string can be transformed to memcpy.

Yeah, as follow up patch.
snprintf (d, size, "%s", "constant") is handled currently, so we should not regress the current "snprintf" calls with this transform..

In D67986#1681631, @efriedma wrote:

If we're going to start generating memccpy calls, do we need to implement optimizations for it? For example, memccpy of a constant string can be transformed to memcpy.

Ok, done: https://reviews.llvm.org/D68089

Wrote a simple microbenchmark - memccpy is 4-5x faster than sprintf in this case.

Oh, I think the proposed transformation in that paper is incorrect.

It should rather be:
memccpy(d,s,0, n-1)
d[n-1] = 0

Since "A terminating null character is automatically appended after the content written." (snprintf)

Yes, that's right. Thanks for spotting that.

Updated. Current tranformations should be correct now.

Rebased + added one new test

xbolva00 edited the summary of this revision. (Show Details)Oct 2 2019, 8:45 AM

xbolva00 added a reviewer: nickdesaulniers.

xbolva00 edited the summary of this revision. (Show Details)

lebedev.ri added a subscriber: lebedev.ri.Oct 2 2019, 9:49 AM

lebedev.ri added inline comments.

lib/Transforms/Utils/SimplifyLibCalls.cpp
2581	Where did we ask TLI about the existence of `memccpy`?
2583	It likely should be `CreateInBoundsGEP()`.

xbolva00 marked 2 inline comments as done.Oct 2 2019, 9:56 AM

xbolva00 added inline comments.

lib/Transforms/Utils/SimplifyLibCalls.cpp
2581	emitMemCCpy calls emitLibcall which checks it - otherwise returns nullptr. Ah, and we should not emit store in that case, if emitmemccpy failed. Thanks!
2583	I was wondering about this too, then code above is broken too :) I will fix it.

Addressed review comments

xbolva00 marked an inline comment as done.Oct 2 2019, 10:09 AM

xbolva00 added inline comments.

test/Transforms/InstCombine/snprintf-memccpy.ll
47	hmm... Should I call DecSize->eraseFromParent() ?

It sounds like memccpy is part of C20. Can we not do this transform unless LangOpt says we're C20 or greater? Otherwise the Linux kernel doesn't implement this routine.

In D67986#1691643, @nickdesaulniers wrote:

It sounds like memccpy is part of C20. Can we not do this transform unless LangOpt says we're C20 or greater? Otherwise the Linux kernel doesn't implement this routine.

LLVM middle-end does not and should not care what (if any) language standard the frontend was targeting when producing the initial IR.
Only the presence (as per TargetLoweringInfo::has()) of the replacement builtin matters.

The kernel should use same workaround as for bcmp transformation in this case.

Otherwise the Linux kernel doesn't implement this routine.

Original paper is from a GCC dev, so I think GCC will implement memccpy like optimizations for GCC 10 too.

Can the kernel just implement it this easy way? memccpy is not very known but I believe there are places in the kernel where memccpy could be used efficiently by kernel devs.

https://code.woboq.org/userspace/glibc/string/memccpy.c.html

It sounds like memccpy is part of C20. Can we not do this transform unless LangOpt says we're C20 or greater? Otherwise the Linux kernel doesn't implement this routine.

Doesn't the kernel use -fno-builtin? That should be enough to suppress this optimization.

In general, we assume that TargetLibraryInfo is providing an accurate picture of what functions are available. If it isn't, we should fix that in some general way.

but -fno-builtin is maybe too aggressive? somebody should benchmark kernel with/without it.

Anyway, I thought -fno-builtin-memccpy also could disable it, but no.. -fno-builtin-snprintf / -ffreestanding works.

Oh, hmm... actually, just realized something; is the new formulation actually equivalent? Specifically, is it okay to write to "d[size - 1]" if the string is short? Say, for example, you have something like snprintf(buf, 10, "%s", "x"). That normally writes to two elements of the array: 0 and 1. The rewritten version writes to three elements: 0, 1, and 9.

-fno-builtin-memccpy doesn't work because memccpy isn't listed in include/clang/Basic/Builtins.def. We should probably fix that.

In D67986#1692020, @efriedma wrote:

Oh, hmm... actually, just realized something; is the new formulation actually equivalent? Specifically, is it okay to write to "d[size - 1]" if the string is short? Say, for example, you have something like snprintf(buf, 10, "%s", "x"). That normally writes to two elements of the array: 0 and 1. The rewritten version writes to three elements: 0, 1, and 9.

Sounds like this should be guarded to only work for simple (non-volatile, non-atomic) pointers?

-fno-builtin-memccpy doesn't work because memccpy isn't listed in include/clang/Basic/Builtins.def. We should probably fix that.

Just add new record:
LIBBUILTIN(memccpy, "v*v*vC*iz", "f", "string.h", ALL_LANGUAGES)

Than great, then linux kernel would just use -fno-builtin-memccpy and we are fine here.

Sounds like this should be guarded to only work for simple (non-volatile, non-atomic) pointers?

Good catch, yeah, I will do it. 'Dst' must be simple ptr.

Say, for example, you have something like snprintf(buf, 10, "%s", "x"). That normally writes to two elements of the array: 0 and 1. The rewritten version writes to three elements: 0, 1, and 9.

Hm, this is right. We overwrite d[size -1] always -> bad. I have no more ideas, probably we can do nothing, just abandon this transformation.

You could do something like if (!memccpy (d, s, '\0', size - 1)) d[size - 1] = 0;, I guess. That's a few more instructions that I'd really like, but I don't see a better alternative.

In D67986#1692126, @efriedma wrote:

You could do something like if (!memccpy (d, s, '\0', size - 1)) d[size - 1] = 0;, I guess. That's a few more instructions that I'd really like, but I don't see a better alternative.

Yeah, but probably InstCombine is not best place to create such ranches/phis, select would be better.

Maybe this could work?
char *ret = memccpy (d, s, '\0', n - 1);
char * ptr = ret ? ret - 1 : &d[n - 1];
*ptr = 0;

Missclicked, sorry.

And yes, with D68377 -fno-builtin-memccpy works -> Ideal fix for the linux kernel.

I'd make the argument that it should be strlcpy if present. That covers a lot of existing systems too. I'm not sure this optimisation should be done for freestanding mode though.

I'm not sure this optimisation should be done for freestanding mode though.

I said it above, -ffreestanding disables this transformation too.

I'd make the argument that it should be strlcpy if present. That covers a lot of existing systems too.

strlcpy is good idea and it is faster than memccpy. but.. I need to link with -lbsd on Ubuntu.

I dont use BSD systems, so I am not gonna to do your suggested transformation since I cant test it. But patches are welcome, I will be happy to look at it if you send one here.

In D67986#1687323, @xbolva00 wrote:

Oh, I think the proposed transformation in that paper is incorrect.

It should rather be:
memccpy(d,s,0, n-1)
d[n-1] = 0

Since "A terminating null character is automatically appended after the content written." (snprintf)

@efriedma, what do you think about this sequence?

The sequence from https://reviews.llvm.org/D67986#1692153 works, I guess.

This transformation seems to increase code size significantly. Is the snprintf "%s" pattern common enough? I suspect most projects have already used memccpy, stpncpy, strscpy, or strlcpy. For the few that don't, the performance probably does not matter.

In D67986#1702901, @MaskRay wrote:

This transformation seems to increase code size significantly. Is the snprintf "%s" pattern common enough? I suspect most projects have already used memccpy, stpncpy, strscpy, or strlcpy. For the few that don't, the performance probably does not matter.

Yes, quite common. But okay, if you dont want it, let's just abandon it.

xbolva00 mentioned this in D68377: [Builtins] Teach Clang about memccpy.Oct 9 2019, 11:08 PM

In D67986#1703031, @xbolva00 wrote:

In D67986#1702901, @MaskRay wrote:

This transformation seems to increase code size significantly. Is the snprintf "%s" pattern common enough? I suspect most projects have already used memccpy, stpncpy, strscpy, or strlcpy. For the few that don't, the performance probably does not matter.

Yes, quite common. But okay, if you dont want it, let's just abandon it.

I wouldn't have quit on this so easily, but OK.

BTW, nowdays we transform snprintf(d, s, "%s" , ...) into two calls - memcpy + strlen - and nobody is concerned about code size increase anyway.

So I dont think code size is problem here (if so, !optForSize). Various InstCombine transformations produce two calls. (here it is just call + select..).

I will continue with this patch.

In D67986#1702901, @MaskRay wrote:

This transformation seems to increase code size significantly. Is the snprintf "%s" pattern common enough? I suspect most projects have already used memccpy, stpncpy, strscpy, or strlcpy. For the few that don't, the performance probably does not matter.

Sounds like then maybe this optimization should conditionally occur based on optimization level/goals? For instance, maybe it's not appropriate at -Os, but is at -O2?

I am fine with your suggestion to restrict it like you said.

(Generally, I think more transforms in instcombine should be restricted this way)

What is the status for this one?

I will update this soon.

In D67986#1702901, @MaskRay wrote:

This transformation seems to increase code size significantly. Is the snprintf "%s" pattern common enough? I suspect most projects have already used memccpy, stpncpy, strscpy, or strlcpy. For the few that don't, the performance probably does not matter.

I think this is right, since correct expansion via memccpy is quite big even for -O3.

Revision Contents

Path

Size

lib/

Transforms/

Utils/

SimplifyLibCalls.cpp

44 lines

test/

Transforms/

InstCombine/

snprintf-memccpy.ll

83 lines

Diff 222857

lib/Transforms/Utils/SimplifyLibCalls.cpp

Show First 20 Lines • Show All 2,442 Lines • ▼ Show 20 Lines	Value LibCallSimplifier::optimizeSPrintFString(CallInst CI, IRBuilder<> &B) {
// Decode the second character of the format string.		// Decode the second character of the format string.
if (FormatStr[1] == 'c') {		if (FormatStr[1] == 'c') {
// sprintf(dst, "%c", chr) --> (i8)dst = chr; ((i8)dst+1) = 0		// sprintf(dst, "%c", chr) --> (i8)dst = chr; ((i8)dst+1) = 0
if (!CI->getArgOperand(2)->getType()->isIntegerTy())		if (!CI->getArgOperand(2)->getType()->isIntegerTy())
return nullptr;		return nullptr;
Value *V = B.CreateTrunc(CI->getArgOperand(2), B.getInt8Ty(), "char");		Value *V = B.CreateTrunc(CI->getArgOperand(2), B.getInt8Ty(), "char");
Value *Ptr = castToCStr(CI->getArgOperand(0), B);		Value *Ptr = castToCStr(CI->getArgOperand(0), B);
B.CreateStore(V, Ptr);		B.CreateStore(V, Ptr);
Ptr = B.CreateGEP(B.getInt8Ty(), Ptr, B.getInt32(1), "nul");		Ptr = B.CreateInBoundsGEP(B.getInt8Ty(), Ptr, B.getInt32(1), "nul");
B.CreateStore(B.getInt8(0), Ptr);		B.CreateStore(B.getInt8(0), Ptr);

return ConstantInt::get(CI->getType(), 1);		return ConstantInt::get(CI->getType(), 1);
}		}

if (FormatStr[1] == 's') {		if (FormatStr[1] == 's') {
// sprintf(dest, "%s", str) -> llvm.memcpy(align 1 dest, align 1 str,		// sprintf(dest, "%s", str) -> llvm.memcpy(align 1 dest, align 1 str,
// strlen(str)+1)		// strlen(str)+1)
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	if (TLI->has(LibFunc_small_sprintf) && !callHasFP128Argument(CI)) {
return New;		return New;
}		}

annotateNonNullBasedOnAccess(CI, {0, 1});		annotateNonNullBasedOnAccess(CI, {0, 1});
return nullptr;		return nullptr;
}		}

Value LibCallSimplifier::optimizeSnPrintFString(CallInst CI, IRBuilder<> &B) {		Value LibCallSimplifier::optimizeSnPrintFString(CallInst CI, IRBuilder<> &B) {
// Check for size		Value *Dst = CI->getArgOperand(0);
ConstantInt *Size = dyn_cast<ConstantInt>(CI->getArgOperand(1));		Value *SizeArg = CI->getArgOperand(1);
if (!Size)		ConstantInt *Size = dyn_cast<ConstantInt>(SizeArg);
return nullptr;

uint64_t N = Size->getZExtValue();
// Check for a fixed format string.		// Check for a fixed format string.
StringRef FormatStr;		StringRef FormatStr;
if (!getConstantStringInfo(CI->getArgOperand(2), FormatStr))		if (!getConstantStringInfo(CI->getArgOperand(2), FormatStr))
return nullptr;		return nullptr;

// If we just have a format string (nothing else crazy) transform it.		// If we just have a format string (nothing else crazy) transform it.
if (CI->getNumArgOperands() == 3) {		if (CI->getNumArgOperands() == 3) {
// Make sure there's no % in the constant array. We could try to handle		// Make sure there's no % in the constant array. We could try to handle
// %% -> % in the future if we cared.		// %% -> % in the future if we cared.
if (FormatStr.find('%') != StringRef::npos)		if (FormatStr.find('%') != StringRef::npos)
return nullptr; // we found a format specifier, bail out.		return nullptr; // we found a format specifier, bail out.

		if (!Size)
		return nullptr;
		uint64_t N = Size->getZExtValue();
if (N == 0)		if (N == 0)
return ConstantInt::get(CI->getType(), FormatStr.size());		return ConstantInt::get(CI->getType(), FormatStr.size());
else if (N < FormatStr.size() + 1)		else if (N < FormatStr.size() + 1)
return nullptr;		return nullptr;

// snprintf(dst, size, fmt) -> llvm.memcpy(align 1 dst, align 1 fmt,		// snprintf(dst, size, fmt) -> llvm.memcpy(align 1 dst, align 1 fmt,
// strlen(fmt)+1)		// strlen(fmt)+1)
B.CreateMemCpy(		B.CreateMemCpy(
CI->getArgOperand(0), 1, CI->getArgOperand(2), 1,		Dst, 1, CI->getArgOperand(2), 1,
ConstantInt::get(DL.getIntPtrType(CI->getContext()),		ConstantInt::get(DL.getIntPtrType(CI->getContext()),
FormatStr.size() + 1)); // Copy the null byte.		FormatStr.size() + 1)); // Copy the null byte.
return ConstantInt::get(CI->getType(), FormatStr.size());		return ConstantInt::get(CI->getType(), FormatStr.size());
}		}

// The remaining optimizations require the format string to be "%s" or "%c"		// The remaining optimizations require the format string to be "%s" or "%c"
// and have an extra operand.		// and have an extra operand.
if (FormatStr.size() == 2 && FormatStr[0] == '%' &&		if (FormatStr.size() == 2 && FormatStr[0] == '%' &&
CI->getNumArgOperands() == 4) {		CI->getNumArgOperands() == 4) {

// Decode the second character of the format string.		// Decode the second character of the format string.
if (FormatStr[1] == 'c') {		if (FormatStr[1] == 'c') {
		if (!Size)
		return nullptr;
		uint64_t N = Size->getZExtValue();
if (N == 0)		if (N == 0)
return ConstantInt::get(CI->getType(), 1);		return ConstantInt::get(CI->getType(), 1);
else if (N == 1)		else if (N == 1)
return nullptr;		return nullptr;

// snprintf(dst, size, "%c", chr) --> (i8)dst = chr; ((i8)dst+1) = 0		// snprintf(dst, size, "%c", chr) --> (i8)dst = chr; ((i8)dst+1) = 0
if (!CI->getArgOperand(3)->getType()->isIntegerTy())		if (!CI->getArgOperand(3)->getType()->isIntegerTy())
return nullptr;		return nullptr;
Value *V = B.CreateTrunc(CI->getArgOperand(3), B.getInt8Ty(), "char");		Value *V = B.CreateTrunc(CI->getArgOperand(3), B.getInt8Ty(), "char");
Value *Ptr = castToCStr(CI->getArgOperand(0), B);		Value *Ptr = castToCStr(Dst, B);
B.CreateStore(V, Ptr);		B.CreateStore(V, Ptr);
Ptr = B.CreateGEP(B.getInt8Ty(), Ptr, B.getInt32(1), "nul");		Ptr = B.CreateInBoundsGEP(B.getInt8Ty(), Ptr, B.getInt32(1), "nul");
B.CreateStore(B.getInt8(0), Ptr);		B.CreateStore(B.getInt8(0), Ptr);

return ConstantInt::get(CI->getType(), 1);		return ConstantInt::get(CI->getType(), 1);
}		}

if (FormatStr[1] == 's') {		if (FormatStr[1] == 's') {
// snprintf(dest, size, "%s", str) to llvm.memcpy(dest, str, len+1, 1)		// snprintf(dest, size, "%s", str) to llvm.memcpy(dest, str, len+1, 1)
StringRef Str;		StringRef Str;
if (!getConstantStringInfo(CI->getArgOperand(3), Str))		if (!getConstantStringInfo(CI->getArgOperand(3), Str)) {
		if (CI->use_empty() && isKnownNonZero(SizeArg, DL)) {
		// snprintf (d, size, "%s", s) -> memccpy (d, s, '\0', size - 1),
		// d[size - 1] = 0
		Value *DecSize =
		B.CreateSub(SizeArg, ConstantInt::get(SizeArg->getType(), 1));
		Value *V = emitMemCCpy(Dst, CI->getArgOperand(3), B.getInt32('\0'), DecSize, B,
		lebedev.riUnsubmitted Not Done Reply Inline Actions Where did we ask TLI about the existence of `memccpy`? lebedev.ri: Where did we ask TLI about the existence of `memccpy`?
		xbolva00AuthorUnsubmitted Done Reply Inline Actions emitMemCCpy calls emitLibcall which checks it - otherwise returns nullptr. Ah, and we should not emit store in that case, if emitmemccpy failed. Thanks! xbolva00: emitMemCCpy calls emitLibcall which checks it - otherwise returns nullptr. Ah, and we should…
		TLI);
		if (V) {
		lebedev.riUnsubmitted Not Done Reply Inline Actions It likely should be `CreateInBoundsGEP()`. lebedev.ri: It likely should be `CreateInBoundsGEP()`.
		xbolva00AuthorUnsubmitted Done Reply Inline Actions I was wondering about this too, then code above is broken too :) I will fix it. xbolva00: I was wondering about this too, then code above is broken too :) I will fix it.
		Value *DstEnd = B.CreateInBoundsGEP(B.getInt8Ty(), Dst, DecSize);
		B.CreateStore(B.getInt8(0), DstEnd);
		return Dst;
		}
		}
return nullptr;		return nullptr;
		}

		if (!Size)
		return nullptr;
		uint64_t N = Size->getZExtValue();
if (N == 0)		if (N == 0)
return ConstantInt::get(CI->getType(), Str.size());		return ConstantInt::get(CI->getType(), Str.size());
else if (N < Str.size() + 1)		else if (N < Str.size() + 1)
return nullptr;		return nullptr;

B.CreateMemCpy(CI->getArgOperand(0), 1, CI->getArgOperand(3), 1,		B.CreateMemCpy(Dst, 1, CI->getArgOperand(3), 1,
ConstantInt::get(CI->getType(), Str.size() + 1));		ConstantInt::get(CI->getType(), Str.size() + 1));

// The snprintf result is the unincremented number of bytes in the string.		// The snprintf result is the unincremented number of bytes in the string.
return ConstantInt::get(CI->getType(), Str.size());		return ConstantInt::get(CI->getType(), Str.size());
}		}
}		}
return nullptr;		return nullptr;
}		}
▲ Show 20 Lines • Show All 879 Lines • Show Last 20 Lines

test/Transforms/InstCombine/snprintf-memccpy.ll

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt < %s -instcombine -S \| FileCheck %s --check-prefixes=CHECK,CHECK-MEMCCPY
				; RUN: opt < %s -instcombine -S -mtriple=x86_64-pc-windows-msvc \| FileCheck %s --check-prefixes=CHECK,CHECK-NO-MEMCCPY

				@.str = private constant [3 x i8] c"%s\00", align 1
				declare i32 @snprintf(i8, i64, i8, ...)

				define void @test_string_to_buf_retval_nonzero_n(i8* %buf, i8* %str) {
				; CHECK-MEMCCPY-LABEL: @test_string_to_buf_retval_nonzero_n(
				; CHECK-MEMCCPY-NEXT: [[MEMCCPY:%.]] = call i8 @memccpy(i8* [[BUF:%.]], i8 [[STR:%.*]], i32 0, i64 7)
				; CHECK-MEMCCPY-NEXT: [[TMP1:%.]] = getelementptr inbounds i8, i8 [[BUF]], i64 7
				; CHECK-MEMCCPY-NEXT: store i8 0, i8* [[TMP1]], align 1
				; CHECK-MEMCCPY-NEXT: ret void
				;
				; CHECK-NO-MEMCCPY-LABEL: @test_string_to_buf_retval_nonzero_n(
				; CHECK-NO-MEMCCPY-NEXT: [[CALL:%.]] = call i32 (i8, i64, i8, ...) @snprintf(i8 nonnull dereferenceable(1) [[BUF:%.]], i64 8, i8 getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* [[STR:%.*]])
				; CHECK-NO-MEMCCPY-NEXT: ret void
				;
				%call = call i32 (i8, i64, i8, ...) @snprintf(i8* %buf, i64 8, i8* getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* %str)
				ret void
				}

				define void @test_string_to_buf_retval_n_one(i8* %buf, i8* %str) {
				; CHECK-MEMCCPY-LABEL: @test_string_to_buf_retval_n_one(
				; CHECK-MEMCCPY-NEXT: [[MEMCCPY:%.]] = call i8 @memccpy(i8* [[BUF:%.]], i8 [[STR:%.*]], i32 0, i64 0)
				; CHECK-MEMCCPY-NEXT: store i8 0, i8* [[BUF]], align 1
				; CHECK-MEMCCPY-NEXT: ret void
				;
				; CHECK-NO-MEMCCPY-LABEL: @test_string_to_buf_retval_n_one(
				; CHECK-NO-MEMCCPY-NEXT: [[CALL:%.]] = call i32 (i8, i64, i8, ...) @snprintf(i8 nonnull dereferenceable(1) [[BUF:%.]], i64 1, i8 getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* [[STR:%.*]])
				; CHECK-NO-MEMCCPY-NEXT: ret void
				;
				%call = call i32 (i8, i64, i8, ...) @snprintf(i8* %buf, i64 1, i8* getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* %str)
				ret void
				}

				define void @test_string_to_buf_retval_known_nonzero_n(i8* %buf, i64 %n, i8* %str) {
				; CHECK-MEMCCPY-LABEL: @test_string_to_buf_retval_known_nonzero_n(
				; CHECK-MEMCCPY-NEXT: [[SIZE:%.]] = shl i64 3, [[N:%.]]
				; CHECK-MEMCCPY-NEXT: [[TMP1:%.*]] = add i64 [[SIZE]], -1
				; CHECK-MEMCCPY-NEXT: [[MEMCCPY:%.]] = call i8 @memccpy(i8* [[BUF:%.]], i8 [[STR:%.*]], i32 0, i64 [[TMP1]])
				; CHECK-MEMCCPY-NEXT: [[TMP2:%.]] = getelementptr inbounds i8, i8 [[BUF]], i64 [[TMP1]]
				; CHECK-MEMCCPY-NEXT: store i8 0, i8* [[TMP2]], align 1
				; CHECK-MEMCCPY-NEXT: ret void
				;
				; CHECK-NO-MEMCCPY-LABEL: @test_string_to_buf_retval_known_nonzero_n(
				; CHECK-NO-MEMCCPY-NEXT: [[SIZE:%.]] = shl i64 3, [[N:%.]]
				xbolva00AuthorUnsubmitted Done Reply Inline Actions hmm... Should I call DecSize->eraseFromParent() ? xbolva00: hmm... Should I call DecSize->eraseFromParent() ?
				; CHECK-NO-MEMCCPY-NEXT: [[TMP1:%.*]] = sub i64 [[SIZE]], 1
				; CHECK-NO-MEMCCPY-NEXT: [[CALL:%.]] = call i32 (i8, i64, i8, ...) @snprintf(i8 nonnull dereferenceable(1) [[BUF:%.]], i64 [[SIZE]], i8 getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* [[STR:%.*]])
				; CHECK-NO-MEMCCPY-NEXT: ret void
				;
				%size = shl i64 3, %n
				%call = call i32 (i8, i64, i8, ...) @snprintf(i8* %buf, i64 %size, i8* getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* %str)
				ret void
				}

				; Negative tests
				define i32 @test_string_to_buf_retval_used_n_maybe_zero(i8* %buf, i64 %n, i8* %str) {
				; CHECK-LABEL: @test_string_to_buf_retval_used_n_maybe_zero(
				; CHECK-NEXT: [[CALL:%.]] = call i32 (i8, i64, i8, ...) @snprintf(i8 [[BUF:%.]], i64 [[N:%.]], i8* getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* [[STR:%.*]])
				; CHECK-NEXT: ret i32 [[CALL]]
				;
				%call = call i32 (i8, i64, i8, ...) @snprintf(i8* %buf, i64 %n, i8* getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* %str)
				ret i32 %call
				}

				define void @test_string_to_buf_retval_used_zero_n(i8* %buf, i8* %str) {
				; CHECK-LABEL: @test_string_to_buf_retval_used_zero_n(
				; CHECK-NEXT: [[CALL:%.]] = call i32 (i8, i64, i8, ...) @snprintf(i8 [[BUF:%.]], i64 0, i8 getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* [[STR:%.*]])
				; CHECK-NEXT: ret void
				;
				%call = call i32 (i8, i64, i8, ...) @snprintf(i8* %buf, i64 0, i8* getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* %str)
				ret void
				}

				define void @test_string_to_buf_retval_unused(i8* %buf, i64 %n, i8* %str) {
				; CHECK-LABEL: @test_string_to_buf_retval_unused(
				; CHECK-NEXT: [[CALL:%.]] = call i32 (i8, i64, i8, ...) @snprintf(i8 [[BUF:%.]], i64 [[N:%.]], i8* getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* [[STR:%.*]])
				; CHECK-NEXT: ret void
				;
				%call = call i32 (i8, i64, i8, ...) @snprintf(i8* %buf, i64 %n, i8* getelementptr inbounds ([3 x i8], [3 x i8]* @.str, i64 0, i64 0), i8* %str)
				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] snprintf (d, size, "%s", s) -> memccpy (d, s, '\0', size - 1), d[size - 1] = 0AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 222857

lib/Transforms/Utils/SimplifyLibCalls.cpp

test/Transforms/InstCombine/snprintf-memccpy.ll

[InstCombine] snprintf (d, size, "%s", s) -> memccpy (d, s, '\0', size - 1), d[size - 1] = 0
AbandonedPublic