This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/CodeGen/
-
CodeGen/
1/3
CGExprScalar.cpp
-
test/CodeGen/
-
CodeGen/
-
compound-assign-overflow.c
1/5
ubsan-promoted-arith.cpp
-
unsigned-promotion.c

Differential D29369

[ubsan] Omit superflous overflow checks for promoted arithmetic (PR20193)
ClosedPublic

Authored by vsk on Jan 31 2017, 9:17 PM.

Download Raw Diff

Details

Reviewers

filcab
dtzWill

Commits

rG82ee16beb8fe: [ubsan] Omit superflous overflow checks for promoted arithmetic (PR20193)
rC296213: [ubsan] Omit superflous overflow checks for promoted arithmetic (PR20193)
rL296213: [ubsan] Omit superflous overflow checks for promoted arithmetic (PR20193)

Summary

C requires the operands of arithmetic expressions to be promoted if
their types are smaller than an int. Ubsan emits overflow checks when
this sort of type promotion occurs, even if there is no way to actually
get an overflow with the promoted type.

This patch teaches clang how to omit the superflous overflow checks
(addressing PR20193).

Testing: check-clang and check-ubsan.

Diff Detail

Event Timeline

vsk created this revision.Jan 31 2017, 9:17 PM

efriedma added a subscriber: efriedma.Feb 1 2017, 10:36 AM

efriedma added inline comments.

lib/CodeGen/CGExprScalar.cpp
73	Checking isPromotableIntegerType doesn't work the way you want it to; types can be "promoted" without actually widening them. For example, enum types are promotable, and in C++ wchar_t is promotable.

vsk marked an inline comment as done.Feb 1 2017, 3:28 PM

vsk added inline comments.

lib/CodeGen/CGExprScalar.cpp
73	Thanks for catching this! I have fixed the issue and will update this patch shortly.

Per Eli's comment: check that integers are actually widened, instead of incorrectly assuming they are always widened. I added some test cases for this.
Address the 'fixme' regarding multiplication with unsigned operands.

Remove a stale test case in unsigned-promotion.c.

Out of curiosity, how many of these superfluous checks are not subsequently eliminated by InstCombine?

In D29369#664366, @regehr wrote:

Out of curiosity, how many of these superfluous checks are not subsequently eliminated by InstCombine?

I don't have numbers from a benchmark prepped. Here's what we get with the 'ubsan-promoted-arith.cpp' test case from this patch:

Setup	# of overflow checks
unpatched, -O0	22
unpatched, -O0 + instcombine	7
patched, -O0	8
patched, -O0 + instcombine	7

(There's a difference between the "patched, -O0" setup and the "patched, -O0 + instcombine" setup because llvm figures out that the symbol 'a' is 0, and gets rid of an addition that way.)

At least for us, this patch is still worthwhile, because our use case is -O0 -fsanitized=undefined. Also, this makes less work for instcombine, but I haven't measured the compile-time effect.

Why the switch to if instead of a fully-covered switch/case?

In D29369#664426, @vsk wrote:

In D29369#664366, @regehr wrote:

Out of curiosity, how many of these superfluous checks are not subsequently eliminated by InstCombine?

I don't have numbers from a benchmark prepped. Here's what we get with the 'ubsan-promoted-arith.cpp' test case from this patch:

Setup # of overflow checks

unpatched, -O0 22

unpatched, -O0 + instcombine 7

patched, -O0 8

patched, -O0 + instcombine 7

(There's a difference between the "patched, -O0" setup and the "patched, -O0 + instcombine" setup because llvm figures out that the symbol 'a' is 0, and gets rid of an addition that way.)

At least for us, this patch is still worthwhile, because our use case is -O0 -fsanitized=undefined. Also, this makes less work for instcombine, but I haven't measured the compile-time effect.

Probably running mem2reg and others before instcombine would make it elide more checks. But if you're using -O0 anyway, I guess this would help anyway.

test/CodeGen/ubsan-promoted-arith.cpp
57	Nit: Maybe `USHRT_MAX * USHRT_MAX` is more understandable?
100	Maybe put the rdar ID next to the FIXME? Like this it looks like you might have written that instead of `CHECK` by mistake.

In D29369#665878, @filcab wrote:

Why the switch to if instead of a fully-covered switch/case?

It lets me avoid repeating two function calls:

switch (CGF.getLangOpts().getSignedOverflowBehavior()) {
case LangOptions::SOB_Defined:
  return Builder.CreateMul(Ops.LHS, Ops.RHS, "mul");
case LangOptions::SOB_Undefined:
  if (CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow) &&
      !CanElideOverflowCheck(CGF.getContext(), Ops))
    return EmitOverflowCheckedBinOp(Ops);
  return Builder.CreateNSWMul(Ops.LHS, Ops.RHS, "mul");
case LangOptions::SOB_Trapping:
  if (CanElideOverflowCheck(CGF.getContext(), Ops))
    return Builder.CreateNSWMul(Ops.LHS, Ops.RHS, "mul");
  return EmitOverflowCheckedBinOp(Ops);
}

test/CodeGen/ubsan-promoted-arith.cpp
57	Yes, I'll fix this.
100	I placed the rdar ID here to communicate that the line should become a CHECK line once the bug is fixed. It will go away with D29437.

In D29369#666308, @vsk wrote:

In D29369#665878, @filcab wrote:

Why the switch to if instead of a fully-covered switch/case?

It lets me avoid repeating two function calls:

Ah, sorry about that, I think I understand what you had in mind now. I'll fix that too.

Use switches per Filipe's comment, and fix a comment in the test case.

Minor nits, now.
LGTM, but having someone more familiar with clang chime in would be great.

lib/CodeGen/CGExprScalar.cpp
1723	Maybe a helper `IsWidenedIntegerOp(...)` (or `IsOpWiderThanBaseType`or something) would make this (and others, like the first return of `getUnwidenedIntegerType`) easier to read?
test/CodeGen/ubsan-promoted-arith.cpp
91	Extra `-`. `INT_MIN/-1` is what you want. You already have a test above for `-INT_MIN` (which would overflow before the division.

filcab accepted this revision.Feb 9 2017, 6:26 AM

This revision is now accepted and ready to land.Feb 9 2017, 6:26 AM

Paging @dtzWill

I've been bitten when attempting to use existence/nature of casts in the AST to reason about the original code, but this looks like it does the right thing in all the situations I can think of.

Missing overflows because of a bugged attempt to optimize the -O0 case would be unfortunate-- has this been tested and compared on larger codes (test-suite, other projects)?

When it comes to C/C++ standards and constructs, it seems there's always some extension/language feature/flag that someone (ab)uses that you forgot all about when reasoning about these things abstractly... so it'd be good to see this checked out before the next major release.

+1 to suggestion for a more readable name or wrapper for the common pattern of 'getUnwidenedIntegerType..hasValue()', if you get a chance.

LGTM, thanks!

After some thought, can we discuss why this is a good idea?

This increases the cyclomatic complexity of code that already is difficult to reason about, and seems like it's both brittle and out-of-place in CGExprScalar.

It really seems it would be better to let InstCombine or some other analysis/transform deal with proving checks redundant instead of attempting to do so on-the-fly during CodeGen.

Can you better motivate why this is worth these costs, or explain your use case a bit more?

This revision now requires changes to proceed.Feb 9 2017, 9:42 AM

In D29369#672166, @dtzWill wrote:

After some thought, can we discuss why this is a good idea?

The goal is to lower ubsan's compile-time + instrumentation overhead at -O0, since this reduces the friction of debugging a ubsan-instrumented project.

This increases the cyclomatic complexity of code that already is difficult to reason about, and seems like it's both brittle and out-of-place in CGExprScalar.

Are there cleanups or ways to reorganize the code that would make this sort of change less complex / brittle? I'm open to taking that on.

It really seems it would be better to let InstCombine or some other analysis/transform deal with proving checks redundant instead of attempting to do so on-the-fly during CodeGen.

-O1/-O2 do get rid of a lot of checks, but they also degrade the debugging experience, so it's not really a solution for this use case.

Can you better motivate why this is worth these costs, or explain your use case a bit more?

I have some numbers from LNT. I did a pre-patch and post-patch run at -O0 + -fsanitize=signed-integer-overflow,unsigned-integer-overflow. There were 4,672 object files produced in each run. This patch brings the average object size down from 36,472.0 to 36,378.3 bytes (a 0.26% improvement), and the average number of overflow checks per object down from 66.8 to 66.2 (a 0.81% improvement).

I don't have reliable compile-time numbers, but not emitting IR really seems like a straightforward improvement over emitting/analyzing/removing it.

So, those are the benefits. IMO getting close to 1% better at reducing instrumentation overhead is worth the complexity cost here.

In D29369#673064, @vsk wrote:

In D29369#672166, @dtzWill wrote:

After some thought, can we discuss why this is a good idea?

The goal is to lower ubsan's compile-time + instrumentation overhead at -O0, since this reduces the friction of debugging a ubsan-instrumented project.

Apologies for the delay, thank you for the explanation.

This increases the cyclomatic complexity of code that already is difficult to reason about, and seems like it's both brittle and out-of-place in CGExprScalar.

Are there cleanups or ways to reorganize the code that would make this sort of change less complex / brittle? I'm open to taking that on.

None that I see immediately (heh, otherwise I'd be working on them myself...) but the code paths for trapping/non-trapping are particularly what I meant re:complexity, and while I suppose the AST or its interface is probably unlikely to change much (?) I'm concerned about these checks silently removing checks they shouldn't in the future.

(Who would notice if this happened?)

It really seems it would be better to let InstCombine or some other analysis/transform deal with proving checks redundant instead of attempting to do so on-the-fly during CodeGen.

-O1/-O2 do get rid of a lot of checks, but they also degrade the debugging experience, so it's not really a solution for this use case.

Understood, that makes sense.

Would running InstCombine (and only InstCombine):

Fail to remove any checks elided by this change?
Have a negative impact on debugging experience? For this I'm mostly asking for a guess, I don't know how to exactly quantify this easily.

(3) Have an undesirable impact on compilation time or other negative consequence?)

Can you better motivate why this is worth these costs, or explain your use case a bit more?

I have some numbers from LNT. I did a pre-patch and post-patch run at -O0 + -fsanitize=signed-integer-overflow,unsigned-integer-overflow. There were 4,672 object files produced in each run. This patch brings the average object size down from 36,472.0 to 36,378.3 bytes (a 0.26% improvement), and the average number of overflow checks per object down from 66.8 to 66.2 (a 0.81% improvement).

Wonderful, thank you for producing and sharing these numbers. Those improvements don't convince me, but if you're saying this is important to you and your use-case/users I'm happy to go with that.

I don't have reliable compile-time numbers, but not emitting IR really seems like a straightforward improvement over emitting/analyzing/removing it.

Hard to say. Separation of concerns is important too, but of course there's trade-offs everywhere :). I'd suspect this doesn't change compile-time much either way.

So, those are the benefits. IMO getting close to 1% better at reducing instrumentation overhead is worth the complexity cost here.

Couldn't say, but that sounds reasonable to me and I don't mean to stand in the way of progress!

Can you answer my questions about InstCombine above? Thanks!

In D29369#676521, @dtzWill wrote:

In D29369#673064, @vsk wrote:

In D29369#672166, @dtzWill wrote:

After some thought, can we discuss why this is a good idea?

The goal is to lower ubsan's compile-time + instrumentation overhead at -O0, since this reduces the friction of debugging a ubsan-instrumented project.

Apologies for the delay, thank you for the explanation.

Np, thanks for taking a look!

This increases the cyclomatic complexity of code that already is difficult to reason about, and seems like it's both brittle and out-of-place in CGExprScalar.

Are there cleanups or ways to reorganize the code that would make this sort of change less complex / brittle? I'm open to taking that on.

None that I see immediately (heh, otherwise I'd be working on them myself...) but the code paths for trapping/non-trapping are particularly what I meant re:complexity, and while I suppose the AST or its interface is probably unlikely to change much (?) I'm concerned about these checks silently removing checks they shouldn't in the future.

(Who would notice if this happened?)

I don't have a good answer for this. I've tried to make sure we don't introduce any false negatives with this patch by covering all the cases I can think of, but it's possible we could have missed something. There are enough people using this feature that I think we'd be alerted to + fix false negatives.

It really seems it would be better to let InstCombine or some other analysis/transform deal with proving checks redundant instead of attempting to do so on-the-fly during CodeGen.

-O1/-O2 do get rid of a lot of checks, but they also degrade the debugging experience, so it's not really a solution for this use case.

Understood, that makes sense.

Would running InstCombine (and only InstCombine):

Fail to remove any checks elided by this change?

No, instcombine gets all of these.

Have a negative impact on debugging experience? For this I'm mostly asking for a guess, I don't know how to exactly quantify this easily.

Probably, but I'm not 100% sure. Instcombine can touch a fair amount of debug info.

(3) Have an undesirable impact on compilation time or other negative consequence?)

Instcombine is one of the slower llvm passes, IIRC. At any rate, the idea of modifying the -O0 pipeline when ubsan is enabled just to turn on instcombine doesn't seem palatable..

Can you better motivate why this is worth these costs, or explain your use case a bit more?

I have some numbers from LNT. I did a pre-patch and post-patch run at -O0 + -fsanitize=signed-integer-overflow,unsigned-integer-overflow. There were 4,672 object files produced in each run. This patch brings the average object size down from 36,472.0 to 36,378.3 bytes (a 0.26% improvement), and the average number of overflow checks per object down from 66.8 to 66.2 (a 0.81% improvement).

Wonderful, thank you for producing and sharing these numbers. Those improvements don't convince me, but if you're saying this is important to you and your use-case/users I'm happy to go with that.

Yeah, on average, the patch isn't a huge improvement. What makes it worthwhile (imo) is that the risk is also very low, and that it can pay to emit less IR (for the one person out there that wants to add a million shorts together).

Some context: we have a project that adds ~17,000 integers together in straight-line code (someone must have auto-generated the C code that does this ><...). The amount of add/sub overflow checks generated there brings clang to its knees at -O0, -Os, etc. We had to kill the build of this project. I get that it's not a representative example, but this is the kind of behavior I really don't want unsuspecting users to hit.

I don't have reliable compile-time numbers, but not emitting IR really seems like a straightforward improvement over emitting/analyzing/removing it.

Hard to say. Separation of concerns is important too, but of course there's trade-offs everywhere :). I'd suspect this doesn't change compile-time much either way.

So, those are the benefits. IMO getting close to 1% better at reducing instrumentation overhead is worth the complexity cost here.

Couldn't say, but that sounds reasonable to me and I don't mean to stand in the way of progress!

Can you answer my questions about InstCombine above? Thanks!

Ping, is the argument in favor of making the change in my last comment satisfactory?

Make the suggested readability improvements, and fix a comment in the test case.

Sorry for the delay!

LGTM, thanks!

This revision is now accepted and ready to land.Feb 24 2017, 4:41 PM

Closed by commit rL296213: [ubsan] Omit superflous overflow checks for promoted arithmetic (PR20193) (authored by vedantk). · Explain WhyFeb 24 2017, 4:55 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

lib/

CodeGen/

CGExprScalar.cpp

75 lines

test/

CodeGen/

compound-assign-overflow.c

4 lines

ubsan-promoted-arith.cpp

124 lines

unsigned-promotion.c

113 lines

Diff 89733

lib/CodeGen/CGExprScalar.cpp

Show All 18 Lines
#include "TargetInfo.h"		#include "TargetInfo.h"
#include "clang/AST/ASTContext.h"		#include "clang/AST/ASTContext.h"
#include "clang/AST/DeclObjC.h"		#include "clang/AST/DeclObjC.h"
#include "clang/AST/Expr.h"		#include "clang/AST/Expr.h"
#include "clang/AST/RecordLayout.h"		#include "clang/AST/RecordLayout.h"
#include "clang/AST/StmtVisitor.h"		#include "clang/AST/StmtVisitor.h"
#include "clang/Basic/TargetInfo.h"		#include "clang/Basic/TargetInfo.h"
#include "clang/Frontend/CodeGenOptions.h"		#include "clang/Frontend/CodeGenOptions.h"
		#include "llvm/ADT/Optional.h"
#include "llvm/IR/CFG.h"		#include "llvm/IR/CFG.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/GlobalVariable.h"		#include "llvm/IR/GlobalVariable.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include <cstdarg>		#include <cstdarg>
Show All 18 Lines

static bool MustVisitNullValue(const Expr *E) {		static bool MustVisitNullValue(const Expr *E) {
// If a null pointer expression's type is the C++0x nullptr_t, then		// If a null pointer expression's type is the C++0x nullptr_t, then
// it's not necessarily a simple constant and it must be evaluated		// it's not necessarily a simple constant and it must be evaluated
// for its potential side effects.		// for its potential side effects.
return E->getType()->isNullPtrType();		return E->getType()->isNullPtrType();
}		}

		/// If \p E is a widened promoted integer, get its base (unpromoted) type.
		static llvm::Optional<QualType> getUnwidenedIntegerType(const ASTContext &Ctx,
		const Expr *E) {
		const Expr *Base = E->IgnoreImpCasts();
		if (E == Base)
		return llvm::None;

		QualType BaseTy = Base->getType();
		if (!BaseTy->isPromotableIntegerType() \|\|
		Ctx.getTypeSize(BaseTy) >= Ctx.getTypeSize(E->getType()))
		return llvm::None;

		efriedmaUnsubmitted Done Reply Inline Actions Checking isPromotableIntegerType doesn't work the way you want it to; types can be "promoted" without actually widening them. For example, enum types are promotable, and in C++ wchar_t is promotable. efriedma: Checking isPromotableIntegerType doesn't work the way you want it to; types can be "promoted"…
		vskAuthorUnsubmitted Not Done Reply Inline Actions Thanks for catching this! I have fixed the issue and will update this patch shortly. vsk: Thanks for catching this! I have fixed the issue and will update this patch shortly.
		return BaseTy;
		}

		/// Check if \p E is a widened promoted integer.
		static bool IsWidenedIntegerOp(const ASTContext &Ctx, const Expr *E) {
		return getUnwidenedIntegerType(Ctx, E).hasValue();
		}

		/// Check if we can skip the overflow check for \p Op.
		static bool CanElideOverflowCheck(const ASTContext &Ctx, const BinOpInfo &Op) {
		assert(isa<UnaryOperator>(Op.E) \|\|
		isa<BinaryOperator>(Op.E) && "Expected a unary or binary operator");

		if (const auto *UO = dyn_cast<UnaryOperator>(Op.E))
		return IsWidenedIntegerOp(Ctx, UO->getSubExpr());

		const auto *BO = cast<BinaryOperator>(Op.E);
		auto OptionalLHSTy = getUnwidenedIntegerType(Ctx, BO->getLHS());
		if (!OptionalLHSTy)
		return false;

		auto OptionalRHSTy = getUnwidenedIntegerType(Ctx, BO->getRHS());
		if (!OptionalRHSTy)
		return false;

		QualType LHSTy = *OptionalLHSTy;
		QualType RHSTy = *OptionalRHSTy;

		// We usually don't need overflow checks for binary operations with widened
		// operands. Multiplication with promoted unsigned operands is a special case.
		if ((Op.Opcode != BO_Mul && Op.Opcode != BO_MulAssign) \|\|
		!LHSTy->isUnsignedIntegerType() \|\| !RHSTy->isUnsignedIntegerType())
		return true;

		// The overflow check can be skipped if either one of the unpromoted types
		// are less than half the size of the promoted type.
		unsigned PromotedSize = Ctx.getTypeSize(Op.E->getType());
		return (2 * Ctx.getTypeSize(LHSTy)) < PromotedSize \|\|
		(2 * Ctx.getTypeSize(RHSTy)) < PromotedSize;
		}

class ScalarExprEmitter		class ScalarExprEmitter
: public StmtVisitor<ScalarExprEmitter, Value*> {		: public StmtVisitor<ScalarExprEmitter, Value*> {
CodeGenFunction &CGF;		CodeGenFunction &CGF;
CGBuilderTy &Builder;		CGBuilderTy &Builder;
bool IgnoreResultAssign;		bool IgnoreResultAssign;
llvm::LLVMContext &VMContext;		llvm::LLVMContext &VMContext;
public:		public:

▲ Show 20 Lines • Show All 408 Lines • ▼ Show 20 Lines	if (Ops.Ty->isSignedIntegerOrEnumerationType()) {
switch (CGF.getLangOpts().getSignedOverflowBehavior()) {		switch (CGF.getLangOpts().getSignedOverflowBehavior()) {
case LangOptions::SOB_Defined:		case LangOptions::SOB_Defined:
return Builder.CreateMul(Ops.LHS, Ops.RHS, "mul");		return Builder.CreateMul(Ops.LHS, Ops.RHS, "mul");
case LangOptions::SOB_Undefined:		case LangOptions::SOB_Undefined:
if (!CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow))		if (!CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow))
return Builder.CreateNSWMul(Ops.LHS, Ops.RHS, "mul");		return Builder.CreateNSWMul(Ops.LHS, Ops.RHS, "mul");
// Fall through.		// Fall through.
case LangOptions::SOB_Trapping:		case LangOptions::SOB_Trapping:
		if (CanElideOverflowCheck(CGF.getContext(), Ops))
		return Builder.CreateNSWMul(Ops.LHS, Ops.RHS, "mul");
return EmitOverflowCheckedBinOp(Ops);		return EmitOverflowCheckedBinOp(Ops);
}		}
}		}

if (Ops.Ty->isUnsignedIntegerType() &&		if (Ops.Ty->isUnsignedIntegerType() &&
CGF.SanOpts.has(SanitizerKind::UnsignedIntegerOverflow))		CGF.SanOpts.has(SanitizerKind::UnsignedIntegerOverflow) &&
		!CanElideOverflowCheck(CGF.getContext(), Ops))
return EmitOverflowCheckedBinOp(Ops);		return EmitOverflowCheckedBinOp(Ops);

if (Ops.LHS->getType()->isFPOrFPVectorTy())		if (Ops.LHS->getType()->isFPOrFPVectorTy())
return Builder.CreateFMul(Ops.LHS, Ops.RHS, "mul");		return Builder.CreateFMul(Ops.LHS, Ops.RHS, "mul");
return Builder.CreateMul(Ops.LHS, Ops.RHS, "mul");		return Builder.CreateMul(Ops.LHS, Ops.RHS, "mul");
}		}
/// Create a binary op that checks for overflow.		/// Create a binary op that checks for overflow.
/// Currently only supports +, - and *.		/// Currently only supports +, - and *.
▲ Show 20 Lines • Show All 1,159 Lines • ▼ Show 20 Lines	llvm::Value *ScalarExprEmitter::EmitIncDecConsiderOverflowBehavior(
switch (CGF.getLangOpts().getSignedOverflowBehavior()) {		switch (CGF.getLangOpts().getSignedOverflowBehavior()) {
case LangOptions::SOB_Defined:		case LangOptions::SOB_Defined:
return Builder.CreateAdd(InVal, Amount, Name);		return Builder.CreateAdd(InVal, Amount, Name);
case LangOptions::SOB_Undefined:		case LangOptions::SOB_Undefined:
if (!CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow))		if (!CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow))
return Builder.CreateNSWAdd(InVal, Amount, Name);		return Builder.CreateNSWAdd(InVal, Amount, Name);
// Fall through.		// Fall through.
case LangOptions::SOB_Trapping:		case LangOptions::SOB_Trapping:
		if (IsWidenedIntegerOp(CGF.getContext(), E->getSubExpr()))
		filcabUnsubmitted Not Done Reply Inline Actions Maybe a helper `IsWidenedIntegerOp(...)` (or `IsOpWiderThanBaseType`or something) would make this (and others, like the first return of `getUnwidenedIntegerType`) easier to read? filcab: Maybe a helper `IsWidenedIntegerOp(...)` (or `IsOpWiderThanBaseType`or something) would make…
		return Builder.CreateNSWAdd(InVal, Amount, Name);
return EmitOverflowCheckedBinOp(createBinOpInfoFromIncDec(E, InVal, IsInc));		return EmitOverflowCheckedBinOp(createBinOpInfoFromIncDec(E, InVal, IsInc));
}		}
llvm_unreachable("Unknown SignedOverflowBehaviorTy");		llvm_unreachable("Unknown SignedOverflowBehaviorTy");
}		}

llvm::Value *		llvm::Value *
ScalarExprEmitter::EmitScalarPrePostIncDec(const UnaryOperator *E, LValue LV,		ScalarExprEmitter::EmitScalarPrePostIncDec(const UnaryOperator *E, LValue LV,
bool isInc, bool isPre) {		bool isInc, bool isPre) {
▲ Show 20 Lines • Show All 602 Lines • ▼ Show 20 Lines	void ScalarExprEmitter::EmitUndefinedBehaviorIntegerDivAndRemCheck(
const BinOpInfo &Ops, llvm::Value *Zero, bool isDiv) {		const BinOpInfo &Ops, llvm::Value *Zero, bool isDiv) {
SmallVector<std::pair<llvm::Value *, SanitizerMask>, 2> Checks;		SmallVector<std::pair<llvm::Value *, SanitizerMask>, 2> Checks;

if (CGF.SanOpts.has(SanitizerKind::IntegerDivideByZero)) {		if (CGF.SanOpts.has(SanitizerKind::IntegerDivideByZero)) {
Checks.push_back(std::make_pair(Builder.CreateICmpNE(Ops.RHS, Zero),		Checks.push_back(std::make_pair(Builder.CreateICmpNE(Ops.RHS, Zero),
SanitizerKind::IntegerDivideByZero));		SanitizerKind::IntegerDivideByZero));
}		}

		const auto *BO = cast<BinaryOperator>(Ops.E);
if (CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow) &&		if (CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow) &&
Ops.Ty->hasSignedIntegerRepresentation()) {		Ops.Ty->hasSignedIntegerRepresentation() &&
		!IsWidenedIntegerOp(CGF.getContext(), BO->getLHS())) {
llvm::IntegerType *Ty = cast<llvm::IntegerType>(Zero->getType());		llvm::IntegerType *Ty = cast<llvm::IntegerType>(Zero->getType());

llvm::Value *IntMin =		llvm::Value *IntMin =
Builder.getInt(llvm::APInt::getSignedMinValue(Ty->getBitWidth()));		Builder.getInt(llvm::APInt::getSignedMinValue(Ty->getBitWidth()));
llvm::Value *NegOne = llvm::ConstantInt::get(Ty, -1ULL);		llvm::Value *NegOne = llvm::ConstantInt::get(Ty, -1ULL);

llvm::Value *LHSCmp = Builder.CreateICmpNE(Ops.LHS, IntMin);		llvm::Value *LHSCmp = Builder.CreateICmpNE(Ops.LHS, IntMin);
llvm::Value *RHSCmp = Builder.CreateICmpNE(Ops.RHS, NegOne);		llvm::Value *RHSCmp = Builder.CreateICmpNE(Ops.RHS, NegOne);
▲ Show 20 Lines • Show All 335 Lines • ▼ Show 20 Lines	if (op.Ty->isSignedIntegerOrEnumerationType()) {
switch (CGF.getLangOpts().getSignedOverflowBehavior()) {		switch (CGF.getLangOpts().getSignedOverflowBehavior()) {
case LangOptions::SOB_Defined:		case LangOptions::SOB_Defined:
return Builder.CreateAdd(op.LHS, op.RHS, "add");		return Builder.CreateAdd(op.LHS, op.RHS, "add");
case LangOptions::SOB_Undefined:		case LangOptions::SOB_Undefined:
if (!CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow))		if (!CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow))
return Builder.CreateNSWAdd(op.LHS, op.RHS, "add");		return Builder.CreateNSWAdd(op.LHS, op.RHS, "add");
// Fall through.		// Fall through.
case LangOptions::SOB_Trapping:		case LangOptions::SOB_Trapping:
		if (CanElideOverflowCheck(CGF.getContext(), op))
		return Builder.CreateNSWAdd(op.LHS, op.RHS, "add");
return EmitOverflowCheckedBinOp(op);		return EmitOverflowCheckedBinOp(op);
}		}
}		}

if (op.Ty->isUnsignedIntegerType() &&		if (op.Ty->isUnsignedIntegerType() &&
CGF.SanOpts.has(SanitizerKind::UnsignedIntegerOverflow))		CGF.SanOpts.has(SanitizerKind::UnsignedIntegerOverflow) &&
		!CanElideOverflowCheck(CGF.getContext(), op))
return EmitOverflowCheckedBinOp(op);		return EmitOverflowCheckedBinOp(op);

if (op.LHS->getType()->isFPOrFPVectorTy()) {		if (op.LHS->getType()->isFPOrFPVectorTy()) {
// Try to form an fmuladd.		// Try to form an fmuladd.
if (Value *FMulAdd = tryEmitFMulAdd(op, CGF, Builder))		if (Value *FMulAdd = tryEmitFMulAdd(op, CGF, Builder))
return FMulAdd;		return FMulAdd;

return Builder.CreateFAdd(op.LHS, op.RHS, "add");		return Builder.CreateFAdd(op.LHS, op.RHS, "add");
Show All 9 Lines	if (op.Ty->isSignedIntegerOrEnumerationType()) {
switch (CGF.getLangOpts().getSignedOverflowBehavior()) {		switch (CGF.getLangOpts().getSignedOverflowBehavior()) {
case LangOptions::SOB_Defined:		case LangOptions::SOB_Defined:
return Builder.CreateSub(op.LHS, op.RHS, "sub");		return Builder.CreateSub(op.LHS, op.RHS, "sub");
case LangOptions::SOB_Undefined:		case LangOptions::SOB_Undefined:
if (!CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow))		if (!CGF.SanOpts.has(SanitizerKind::SignedIntegerOverflow))
return Builder.CreateNSWSub(op.LHS, op.RHS, "sub");		return Builder.CreateNSWSub(op.LHS, op.RHS, "sub");
// Fall through.		// Fall through.
case LangOptions::SOB_Trapping:		case LangOptions::SOB_Trapping:
		if (CanElideOverflowCheck(CGF.getContext(), op))
		return Builder.CreateNSWSub(op.LHS, op.RHS, "sub");
return EmitOverflowCheckedBinOp(op);		return EmitOverflowCheckedBinOp(op);
}		}
}		}

if (op.Ty->isUnsignedIntegerType() &&		if (op.Ty->isUnsignedIntegerType() &&
CGF.SanOpts.has(SanitizerKind::UnsignedIntegerOverflow))		CGF.SanOpts.has(SanitizerKind::UnsignedIntegerOverflow) &&
		!CanElideOverflowCheck(CGF.getContext(), op))
return EmitOverflowCheckedBinOp(op);		return EmitOverflowCheckedBinOp(op);

if (op.LHS->getType()->isFPOrFPVectorTy()) {		if (op.LHS->getType()->isFPOrFPVectorTy()) {
// Try to form an fmuladd.		// Try to form an fmuladd.
if (Value *FMulAdd = tryEmitFMulAdd(op, CGF, Builder, true))		if (Value *FMulAdd = tryEmitFMulAdd(op, CGF, Builder, true))
return FMulAdd;		return FMulAdd;
return Builder.CreateFSub(op.LHS, op.RHS, "sub");		return Builder.CreateFSub(op.LHS, op.RHS, "sub");
}		}
▲ Show 20 Lines • Show All 967 Lines • Show Last 20 Lines

test/CodeGen/compound-assign-overflow.c

	Show All 19 Lines

	// CHECK: @compaddunsigned			// CHECK: @compaddunsigned
	void compaddunsigned() {			void compaddunsigned() {
	#line 200			#line 200
	x += ((uint32_t)1U);			x += ((uint32_t)1U);
	// CHECK: @__ubsan_handle_add_overflow(i8* bitcast ({{.}} @[[LINE_200]] to i8), {{.*}})			// CHECK: @__ubsan_handle_add_overflow(i8* bitcast ({{.}} @[[LINE_200]] to i8), {{.*}})
	}			}

	int8_t a, b;

	// CHECK: @compdiv			// CHECK: @compdiv
	void compdiv() {			void compdiv() {
	#line 300			#line 300
	a /= b;			x /= x;
	// CHECK: @__ubsan_handle_divrem_overflow(i8* bitcast ({{.}} @[[LINE_300]] to i8), {{.*}})			// CHECK: @__ubsan_handle_divrem_overflow(i8* bitcast ({{.}} @[[LINE_300]] to i8), {{.*}})
	}			}

test/CodeGen/ubsan-promoted-arith.cpp

This file was added.

				// RUN: %clang_cc1 -std=c++11 -triple x86_64-apple-darwin10 -emit-llvm -o - %s -fsanitize=signed-integer-overflow,unsigned-integer-overflow \| FileCheck %s

				typedef unsigned char uchar;
				typedef unsigned short ushort;

				enum E1 : int {
				a
				};

				enum E2 : char {
				b
				};

				// CHECK-LABEL: define signext i8 @_Z4add1
				// CHECK-NOT: sadd.with.overflow
				char add1(char c) { return c + c; }

				// CHECK-LABEL: define zeroext i8 @_Z4add2
				// CHECK-NOT: uadd.with.overflow
				uchar add2(uchar uc) { return uc + uc; }

				// CHECK-LABEL: define i32 @_Z4add3
				// CHECK: sadd.with.overflow
				int add3(E1 e) { return e + a; }

				// CHECK-LABEL: define signext i8 @_Z4add4
				// CHECK-NOT: sadd.with.overflow
				char add4(E2 e) { return e + b; }

				// CHECK-LABEL: define signext i8 @_Z4sub1
				// CHECK-NOT: ssub.with.overflow
				char sub1(char c) { return c - c; }

				// CHECK-LABEL: define zeroext i8 @_Z4sub2
				// CHECK-NOT: usub.with.overflow
				uchar sub2(uchar uc) { return uc - uc; }

				// CHECK-LABEL: define signext i8 @_Z4sub3
				// CHECK-NOT: ssub.with.overflow
				char sub3(char c) { return -c; }

				// Note: -INT_MIN can overflow.
				//
				// CHECK-LABEL: define i32 @_Z4sub4
				// CHECK: ssub.with.overflow
				int sub4(int i) { return -i; }

				// CHECK-LABEL: define signext i8 @_Z4mul1
				// CHECK-NOT: smul.with.overflow
				char mul1(char c) { return c * c; }

				// CHECK-LABEL: define zeroext i8 @_Z4mul2
				// CHECK-NOT: smul.with.overflow
				uchar mul2(uchar uc) { return uc * uc; }

				// Note: USHRT_MAX * USHRT_MAX can overflow.
				//
				filcabUnsubmitted Done Reply Inline Actions Nit: Maybe `USHRT_MAX * USHRT_MAX` is more understandable? filcab: Nit: Maybe `USHRT_MAX * USHRT_MAX` is more understandable?
				vskAuthorUnsubmitted Not Done Reply Inline Actions Yes, I'll fix this. vsk: Yes, I'll fix this.
				// CHECK-LABEL: define zeroext i16 @_Z4mul3
				// CHECK: smul.with.overflow
				ushort mul3(ushort us) { return us * us; }

				// CHECK-LABEL: define i32 @_Z4mul4
				// CHECK: smul.with.overflow
				int mul4(int i, char c) { return i * c; }

				// CHECK-LABEL: define i32 @_Z4mul5
				// CHECK: smul.with.overflow
				int mul5(int i, char c) { return c * i; }

				// CHECK-LABEL: define signext i16 @_Z4mul6
				// CHECK-NOT: smul.with.overflow
				short mul6(short s) { return s * s; }

				// CHECK-LABEL: define signext i8 @_Z4div1
				// CHECK-NOT: ubsan_handle_divrem_overflow
				char div1(char c) { return c / c; }

				// CHECK-LABEL: define zeroext i8 @_Z4div2
				// CHECK-NOT: ubsan_handle_divrem_overflow
				uchar div2(uchar uc) { return uc / uc; }

				// CHECK-LABEL: define signext i8 @_Z4div3
				// CHECK-NOT: ubsan_handle_divrem_overflow
				char div3(char c, int i) { return c / i; }

				// CHECK-LABEL: define signext i8 @_Z4div4
				// CHECK: ubsan_handle_divrem_overflow
				char div4(int i, char c) { return i / c; }

				// Note: INT_MIN / -1 can overflow.
				//
				filcabUnsubmitted Not Done Reply Inline Actions Extra `-`. `INT_MIN/-1` is what you want. You already have a test above for `-INT_MIN` (which would overflow before the division. filcab: Extra `-`. `INT_MIN/-1` is what you want. You already have a test above for `-INT_MIN` (which…
				// CHECK-LABEL: define signext i8 @_Z4div5
				// CHECK: ubsan_handle_divrem_overflow
				char div5(int i, char c) { return i / c; }

				// CHECK-LABEL: define signext i8 @_Z4rem1
				// CHECK-NOT: ubsan_handle_divrem_overflow
				char rem1(char c) { return c % c; }

				// CHECK-LABEL: define zeroext i8 @_Z4rem2
				filcabUnsubmitted Not Done Reply Inline Actions Maybe put the rdar ID next to the FIXME? Like this it looks like you might have written that instead of `CHECK` by mistake. filcab: Maybe put the rdar ID next to the FIXME? Like this it looks like you might have written that…
				vskAuthorUnsubmitted Not Done Reply Inline Actions I placed the rdar ID here to communicate that the line should become a CHECK line once the bug is fixed. It will go away with D29437. vsk: I placed the rdar ID here to communicate that the line should become a CHECK line once the bug…
				// CHECK-NOT: ubsan_handle_divrem_overflow
				uchar rem2(uchar uc) { return uc % uc; }

				// FIXME: This is a long-standing false negative.
				//
				// CHECK-LABEL: define signext i8 @_Z4rem3
				// rdar30301609: ubsan_handle_divrem_overflow
				char rem3(int i, char c) { return i % c; }

				// CHECK-LABEL: define signext i8 @_Z4inc1
				// CHECK-NOT: sadd.with.overflow
				char inc1(char c) { return c++ + (char)0; }

				// CHECK-LABEL: define zeroext i8 @_Z4inc2
				// CHECK-NOT: uadd.with.overflow
				uchar inc2(uchar uc) { return uc++ + (uchar)0; }

				// CHECK-LABEL: define void @_Z4inc3
				// CHECK-NOT: sadd.with.overflow
				void inc3(char c) { c++; }

				// CHECK-LABEL: define void @_Z4inc4
				// CHECK-NOT: uadd.with.overflow
				void inc4(uchar uc) { uc++; }

test/CodeGen/unsigned-promotion.c

	// Check -fsanitize=signed-integer-overflow and			// Check -fsanitize=signed-integer-overflow and
	// -fsanitize=unsigned-integer-overflow with promoted unsigned types			// -fsanitize=unsigned-integer-overflow with promoted unsigned types
	//			//
	// RUN: %clang_cc1 -triple x86_64-apple-darwin10 -emit-llvm -o - %s \			// RUN: %clang_cc1 -triple x86_64-apple-darwin10 -emit-llvm -o - %s \
	// RUN: -fsanitize=signed-integer-overflow \| FileCheck %s --check-prefix=CHECKS			// RUN: -fsanitize=signed-integer-overflow \| FileCheck %s --check-prefix=CHECKS
	// RUN: %clang_cc1 -triple x86_64-apple-darwin10 -emit-llvm -o - %s \			// RUN: %clang_cc1 -triple x86_64-apple-darwin10 -emit-llvm -o - %s \
	// RUN: -fsanitize=unsigned-integer-overflow \| FileCheck %s --check-prefix=CHECKU			// RUN: -fsanitize=unsigned-integer-overflow \| FileCheck %s --check-prefix=CHECKU

	unsigned short si, sj, sk;			unsigned short si, sj, sk;
	unsigned char ci, cj, ck;

	extern void opaqueshort(unsigned short);
	extern void opaquechar(unsigned char);

	// CHECKS-LABEL: define void @testshortadd()
	// CHECKU-LABEL: define void @testshortadd()
	void testshortadd() {
	// CHECKS: load i16, i16* @sj
	// CHECKS: load i16, i16* @sk
	// CHECKS: [[T1:%.]] = call { i32, i1 } @llvm.sadd.with.overflow.i32(i32 [[T2:%.]], i32 [[T3:%.*]])
	// CHECKS-NEXT: [[T4:%.*]] = extractvalue { i32, i1 } [[T1]], 0
	// CHECKS-NEXT: [[T5:%.*]] = extractvalue { i32, i1 } [[T1]], 1
	// CHECKS: call void @__ubsan_handle_add_overflow
	//
	// CHECKU: [[T1:%.]] = load i16, i16 @sj
	// CHECKU: [[T2:%.*]] = zext i16 [[T1]]
	// CHECKU: [[T3:%.]] = load i16, i16 @sk
	// CHECKU: [[T4:%.*]] = zext i16 [[T3]]
	// CHECKU-NOT: llvm.sadd
	// CHECKU-NOT: llvm.uadd
	// CHECKU: [[T5:%.*]] = add nsw i32 [[T2]], [[T4]]

	si = sj + sk;
	}

	// CHECKS-LABEL: define void @testshortsub()
	// CHECKU-LABEL: define void @testshortsub()
	void testshortsub() {

	// CHECKS: load i16, i16* @sj
	// CHECKS: load i16, i16* @sk
	// CHECKS: [[T1:%.]] = call { i32, i1 } @llvm.ssub.with.overflow.i32(i32 [[T2:%.]], i32 [[T3:%.*]])
	// CHECKS-NEXT: [[T4:%.*]] = extractvalue { i32, i1 } [[T1]], 0
	// CHECKS-NEXT: [[T5:%.*]] = extractvalue { i32, i1 } [[T1]], 1
	// CHECKS: call void @__ubsan_handle_sub_overflow
	//
	// CHECKU: [[T1:%.]] = load i16, i16 @sj
	// CHECKU: [[T2:%.*]] = zext i16 [[T1]]
	// CHECKU: [[T3:%.]] = load i16, i16 @sk
	// CHECKU: [[T4:%.*]] = zext i16 [[T3]]
	// CHECKU-NOT: llvm.ssub
	// CHECKU-NOT: llvm.usub
	// CHECKU: [[T5:%.*]] = sub nsw i32 [[T2]], [[T4]]

	si = sj - sk;
	}

	// CHECKS-LABEL: define void @testshortmul()			// CHECKS-LABEL: define void @testshortmul()
	// CHECKU-LABEL: define void @testshortmul()			// CHECKU-LABEL: define void @testshortmul()
	void testshortmul() {			void testshortmul() {

	// CHECKS: load i16, i16* @sj			// CHECKS: load i16, i16* @sj
	// CHECKS: load i16, i16* @sk			// CHECKS: load i16, i16* @sk
	// CHECKS: [[T1:%.]] = call { i32, i1 } @llvm.smul.with.overflow.i32(i32 [[T2:%.]], i32 [[T3:%.*]])			// CHECKS: [[T1:%.]] = call { i32, i1 } @llvm.smul.with.overflow.i32(i32 [[T2:%.]], i32 [[T3:%.*]])
	// CHECKS-NEXT: [[T4:%.*]] = extractvalue { i32, i1 } [[T1]], 0			// CHECKS-NEXT: [[T4:%.*]] = extractvalue { i32, i1 } [[T1]], 0
	// CHECKS-NEXT: [[T5:%.*]] = extractvalue { i32, i1 } [[T1]], 1			// CHECKS-NEXT: [[T5:%.*]] = extractvalue { i32, i1 } [[T1]], 1
	// CHECKS: call void @__ubsan_handle_mul_overflow			// CHECKS: call void @__ubsan_handle_mul_overflow
	//			//
	// CHECKU: [[T1:%.]] = load i16, i16 @sj			// CHECKU: [[T1:%.]] = load i16, i16 @sj
	// CHECKU: [[T2:%.*]] = zext i16 [[T1]]			// CHECKU: [[T2:%.*]] = zext i16 [[T1]]
	// CHECKU: [[T3:%.]] = load i16, i16 @sk			// CHECKU: [[T3:%.]] = load i16, i16 @sk
	// CHECKU: [[T4:%.*]] = zext i16 [[T3]]			// CHECKU: [[T4:%.*]] = zext i16 [[T3]]
	// CHECKU-NOT: llvm.smul			// CHECKU-NOT: llvm.smul
	// CHECKU-NOT: llvm.umul			// CHECKU-NOT: llvm.umul
	// CHECKU: [[T5:%.*]] = mul nsw i32 [[T2]], [[T4]]			// CHECKU: [[T5:%.*]] = mul nsw i32 [[T2]], [[T4]]
	si = sj * sk;			si = sj * sk;
	}			}

	// CHECKS-LABEL: define void @testcharadd()
	// CHECKU-LABEL: define void @testcharadd()
	void testcharadd() {

	// CHECKS: load i8, i8* @cj
	// CHECKS: load i8, i8* @ck
	// CHECKS: [[T1:%.]] = call { i32, i1 } @llvm.sadd.with.overflow.i32(i32 [[T2:%.]], i32 [[T3:%.*]])
	// CHECKS-NEXT: [[T4:%.*]] = extractvalue { i32, i1 } [[T1]], 0
	// CHECKS-NEXT: [[T5:%.*]] = extractvalue { i32, i1 } [[T1]], 1
	// CHECKS: call void @__ubsan_handle_add_overflow
	//
	// CHECKU: [[T1:%.]] = load i8, i8 @cj
	// CHECKU: [[T2:%.*]] = zext i8 [[T1]]
	// CHECKU: [[T3:%.]] = load i8, i8 @ck
	// CHECKU: [[T4:%.*]] = zext i8 [[T3]]
	// CHECKU-NOT: llvm.sadd
	// CHECKU-NOT: llvm.uadd
	// CHECKU: [[T5:%.*]] = add nsw i32 [[T2]], [[T4]]

	ci = cj + ck;
	}

	// CHECKS-LABEL: define void @testcharsub()
	// CHECKU-LABEL: define void @testcharsub()
	void testcharsub() {

	// CHECKS: load i8, i8* @cj
	// CHECKS: load i8, i8* @ck
	// CHECKS: [[T1:%.]] = call { i32, i1 } @llvm.ssub.with.overflow.i32(i32 [[T2:%.]], i32 [[T3:%.*]])
	// CHECKS-NEXT: [[T4:%.*]] = extractvalue { i32, i1 } [[T1]], 0
	// CHECKS-NEXT: [[T5:%.*]] = extractvalue { i32, i1 } [[T1]], 1
	// CHECKS: call void @__ubsan_handle_sub_overflow
	//
	// CHECKU: [[T1:%.]] = load i8, i8 @cj
	// CHECKU: [[T2:%.*]] = zext i8 [[T1]]
	// CHECKU: [[T3:%.]] = load i8, i8 @ck
	// CHECKU: [[T4:%.*]] = zext i8 [[T3]]
	// CHECKU-NOT: llvm.ssub
	// CHECKU-NOT: llvm.usub
	// CHECKU: [[T5:%.*]] = sub nsw i32 [[T2]], [[T4]]

	ci = cj - ck;
	}

	// CHECKS-LABEL: define void @testcharmul()
	// CHECKU-LABEL: define void @testcharmul()
	void testcharmul() {

	// CHECKS: load i8, i8* @cj
	// CHECKS: load i8, i8* @ck
	// CHECKS: [[T1:%.]] = call { i32, i1 } @llvm.smul.with.overflow.i32(i32 [[T2:%.]], i32 [[T3:%.*]])
	// CHECKS-NEXT: [[T4:%.*]] = extractvalue { i32, i1 } [[T1]], 0
	// CHECKS-NEXT: [[T5:%.*]] = extractvalue { i32, i1 } [[T1]], 1
	// CHECKS: call void @__ubsan_handle_mul_overflow
	//
	// CHECKU: [[T1:%.]] = load i8, i8 @cj
	// CHECKU: [[T2:%.*]] = zext i8 [[T1]]
	// CHECKU: [[T3:%.]] = load i8, i8 @ck
	// CHECKU: [[T4:%.*]] = zext i8 [[T3]]
	// CHECKU-NOT: llvm.smul
	// CHECKU-NOT: llvm.umul
	// CHECKU: [[T5:%.*]] = mul nsw i32 [[T2]], [[T4]]

	ci = cj * ck;
	}