This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
5/6
ReleaseNotes.rst
14/23
UndefinedBehaviorSanitizer.rst
-
include/clang/Basic/
-
clang/
-
Basic/
-
Sanitizers.h
1/2
Sanitizers.def
-
lib/
-
CodeGen/
20/26
CGExprScalar.cpp
8/8
CodeGenFunction.h
-
Driver/
-
SanitizerArgs.cpp
-
ToolChain.cpp
-
test/
-
CodeGen/
4/4
catch-implicit-integer-truncations.c
-
CodeGenCXX/
1
catch-implicit-integer-truncations.cpp
-
Driver/
-
fsanitize.c

Differential D48958

[clang][ubsan] Implicit Cast Sanitizer - integer truncation - clang part
ClosedPublic

Authored by lebedev.ri on Jul 5 2018, 1:18 AM.

Download Raw Diff

Details

Reviewers

rjmccall
rsmith
samsonov
pcc
vsk
eugenis
efriedma
kcc
erichkeane

Commits

rGb69ba22773e0: [clang][ubsan] Implicit Conversion Sanitizer - integer truncation - clang part
rL338288: [clang][ubsan] Implicit Conversion Sanitizer - integer truncation - clang part
rC338288: [clang][ubsan] Implicit Conversion Sanitizer - integer truncation - clang part

Summary

C and C++ are interesting languages. They are statically typed, but weakly.
The implicit conversions are allowed. This is nice, allows to write code
while balancing between getting drowned in everything being convertible,
and nothing being convertible. As usual, this comes with a price:

unsigned char store = 0;

bool consume(unsigned int val);

void test(unsigned long val) {
  if (consume(val)) {
    // the 'val' is `unsigned long`, but `consume()` takes `unsigned int`.
    // If their bit widths are different on this platform, the implicit
    // truncation happens. And if that `unsigned long` had a value bigger
    // than UINT_MAX, then you may or may not have a bug.

    // Similarly, integer addition happens on `int`s, so `store` will
    // be promoted to an `int`, the sum calculated (0+768=768),
    // and the result demoted to `unsigned char`, and stored to `store`.
    // In this case, the `store` will still be 0. Again, not always intended.
    store = store + 768; // before addition, 'store' was promoted to int.
  }

  // But yes, sometimes this is intentional.
  // You can either make the conversion explicit
  (void)consume((unsigned int)val);
  // or mask the value so no bits will be *implicitly* lost.
  (void)consume((~((unsigned int)0)) & val);
}

Yes, there is a -Wconversion` diagnostic group, but first, it is kinda
noisy, since it warns on everything (unlike sanitizers, warning on an
actual issues), and second, there are cases where it does not warn.
So a Sanitizer is needed. I don't have any motivational numbers, but i know
i had this kind of problem 10-20 times, and it was never easy to track down.

The logic to detect whether an truncation has happened is pretty simple
if you think about it - https://godbolt.org/g/NEzXbb - basically, just
extend (using the new, not original!, signedness) the 'truncated' value
back to it's original width, and equality-compare it with the original value.

The most non-trivial thing here is the logic to detect whether this
ImplicitCastExpr AST node is actually an implicit conversion, or
part of an explicit cast. Because the explicit casts are modeled as an outer
ExplicitCastExpr with some ImplicitCastExpr's as direct children.
https://godbolt.org/g/eE1GkJ

Nowadays, we can just use the new part_of_explicit_cast flag, which is set
on all the implicitly-added ImplicitCastExpr's of an ExplicitCastExpr.
So if that flag is not set, then it is an actual implicit conversion.

As you may have noted, this isn't just named -fsanitize=implicit-integer-truncation.
There are potentially some more implicit conversions to be warned about.
Namely, implicit conversions that result in sign change; implicit conversion
between different floating point types, or between fp and an integer,
when again, that conversion is lossy.

One thing i know isn't handled is bitfields.

This is a clang part.
The compiler-rt part is D48959.

Fixes PR21530, PR37552, PR35409.
Partially fixes PR9821.
Fixes https://github.com/google/sanitizers/issues/940. (other than sign-changing implicit conversions)

Diff Detail

Repository: rC Clang

Event Timeline

lebedev.ri created this revision.Jul 5 2018, 1:18 AM

lebedev.ri created this object with visibility "All Users".

lebedev.ri mentioned this in D48959: [compiler-rt][ubsan] Implicit Cast Sanitizer - integer truncation - compiler-rt part.

lebedev.ri edited the summary of this revision. (Show Details)Jul 5 2018, 1:29 AM

unsigned char store = 0;

bool consume(unsigned int val);

void test(unsigned long val) {
  if (consume(val)) {
    // the 'val' is `unsigned long`, but `consume()` takes `unsigned int`.
    // If their bit widths are different on this platform, the implicit
    // truncation happens. And if that `unsigned long` had a value bigger
    // than UINT_MAX, then you may or may not have a bug.

    // Similarly, integer addition happens on `int`s, so `store` will
    // be promoted to an `int`, the sum calculated (0+768=768),
    // and the result demoted to `unsigned char`, and stored to `store`.
    // In this case, the `store` will still be 0. Again, not always intended.
    store = store + 768; // before addition, 'store' was promoted to int.
  }

  // But yes, sometimes this is intentional.
  // You can either make the cast explicit
  (void)consume((unsigned int)val);
  // or mask the value so no bits will be *implicitly* lost.
  (void)consume((~((unsigned int)0)) & val);
}

The most non-trivial thing here is the logic to detect whether this
ImplicitCastExpr AST node is actually an implicit cast, or part of
an explicit cast. Because the explicit casts are modeled as an outer
ExplicitCastExpr with some ImplicitCastExpr's as direct children.
https://godbolt.org/g/eE1GkJ

It would seem, we can just check that the current ImplicitCastExpr
we are processing either has no CastExpr parents, or all of them are
ImplicitCastExpr.

As you may have noted, this isn't just named -fsanitize=implicit-integer-truncation.
There are potentially some more implicit casts to be warned about.
Namely, implicit casts that result in sign change; implicit cast
between different floating point types, or between fp and an integer,
when again, that conversion is lossy.

I suspect, there may be some false-negatives, cases yet to be handled.

This is a clang part.
The compiler-rt part is D48959.

Fixes PR21530, PR37552, PR35409.
Partially fixes PR9821.
Fixes https://github.com/google/sanitizers/issues/940.

lebedev.ri changed the visibility from "All Users" to "Public (No Login Required)".Jul 9 2018, 7:27 AM

Finished running it on a normal testset of my pet project.

It fired ~18 times.
There were no obvious false-positives (e.g. when an explicit cast was involved).
At least 3 of those appear to be a true bugs.
4-5 more are probably bugs, but it is hard to tell.
Last 10-11 appear to be mostly OK intentional truncating casts.

This was on a normal test set, i suspect fuzzing will reveal more.

Check that sanitizer is actually enabled before doing the AST upwalk. I didn't measure, but it would be logical for this to be better.

Thanks for working on this!

docs/ReleaseNotes.rst
311	Could you mention whether the group is enabled by -fsanitize=undefined?
lib/CodeGen/CGExprScalar.cpp
314–315	I think the number of overloads here is really unwieldy. There should be a simpler way to structure this. What about consolidating all four overloads into one? Maybe: struct ScalarConversionsOpts { bool TreatBoolAsUnsigned = false; bool EmitImplicitIntegerTruncationCheck = false; }; Value *EmitScalarConversion(Src, SrcTy, DstTy, Loc, Opts = ScalarConversionOpts()) It's not necessary to pass CastExpr in, right? There's only one place where that's done. It seems simpler to just do the SanOpts / isCastPartOfExplicitCast checking there.
951	nit, function names typically begin with a verb: 'isCastPartOf...'
1641	I think maintaining a stack of visited cast exprs in the emitter be cheaper/simpler than using ASTContext::getParents. You could push CE here and use a RAII helper to pop it. The 'isCastPartOfExplicitCast' check then simplifies to a quick stack traversal.

@vsk thank you for taking a look!

Addressed the trivial part of nits.

lebedev.ri added inline comments.Jul 10 2018, 2:51 PM

lib/CodeGen/CGExprScalar.cpp
314–315	The number of overloads is indeed unwieldy.
1641	Hmm, two things come to mind: This pessimizes the (most popular) case when the sanitizer is disabled. `ASTContext::getParents()` may return more than one parent. I'm not sure if that matters here? I'll take a look..

lebedev.ri added a parent revision: D49179: [InstCombine] Fold x & (-1 >> y) == x to x u<= (-1 >> y) .Jul 11 2018, 12:14 PM

vsk added inline comments.Jul 11 2018, 12:46 PM

lib/CodeGen/CGExprScalar.cpp
1641	As for (1), it's not necessary to push/pop the stack when this sanitizer is disabled. And for (2), IIUC the only interesting case is "explicit-cast <implicit-cast>+", and none of the implicit casts here have more than one parent.

lebedev.ri added inline comments.Jul 11 2018, 2:08 PM

lib/CodeGen/CGExprScalar.cpp
1641	I think maintaining a stack of visited cast exprs in the emitter be cheaper/simpler than using ASTContext::getParents. broken.patch5 KBDownload So yeah, this could work. Except sadly it breaks in subtle cases like https://godbolt.org/g/5V2czU I have added those tests beforehand. Is `ASTContext::getParents()` really horribly slow so we want to duplicate/maintain/track the current AST stack ourselves? If so, we will need to maintain the entire stack, not just `CastExpr''s...

Add some more tricky tests where maintaining just the CastExpr part of AST stack would break them.

vsk added a subscriber: klimek.Jul 11 2018, 3:48 PM

vsk added inline comments.

lib/CodeGen/CGExprScalar.cpp
1641	I think the scan in 'IsTopCastPartOfExplicitCast' can be fixed: while traversing backwards, you'd need to check that the previously-visited cast expr is the child of the current expr. That should address the false negative you pointed out in interference1. I don't yet see what the issue is with interference0. Could you explain why maintaining a stack of unfinished casts wouldn't work? I don't understand why you'd need the entire stack. My sense is that it's not required to match the "explicit-cast <implicit-cast>+" pattern, but I could easily be missing something here. As for why this might be worth looking into, I think scanning for an explicit cast is much easier to understand when working with a stack. + @klimek to comment on what to expect in terms of the overhead of ASTContext::getParents. Regardless of what approach we pick, it would help to see pre/post-patch compile times for a stage2 build of something like clang or llc.

Address @vsk's review notes.

Maintain the stack of currently-being-visited CastExpr's
Use that stack to check whether we are in a ExplicitCastExpr
Move logic for deciding whether to emit the check out of EmitScalarConversion()
Condense all overloads of EmitScalarConversion() down to one.

lib/CodeGen/CGExprScalar.cpp
1641	I think the scan in 'IsTopCastPartOfExplicitCast' can be fixed: while traversing backwards, you'd need to check that the previously-visited cast expr is the child of the current expr. That should address the false negative you pointed out in interference1. Oh right. That seems to fix the issues. Could you explain why maintaining a stack of unfinished casts wouldn't work? I didn't think it wouldn't work. I just missed the tidbit about checking children. Then it works.

I have some minor comments but overall I think this is in good shape. It would be great to see some compile-time numbers just to make sure this is tractable. I'm pretty sure -fsanitize=null would fire more often across a codebase than this check, so I don't anticipate a big surprise here.

lib/CodeGen/CGExprScalar.cpp
220	It would help to have this comment explain that the stack is used/maintained exclusively by the implicit cast sanitizer.
226	Could you make this comment more specific -- maybe by explaining that for efficiency reasons, the cast expr stack is only maintained when a sanitizer check is enabled?
232	I think if you were to use references instead of pointers here, the code would be a bit clearer, and you wouldn't need to assert that CE is non-null.
316	Why not use default member initializers here (e.g, "bool a = false")?
957	The none_of call could safely be replaced by `Cast->getSubExpr() != PreviousCast`, I think.

Thank you for taking a look!

In D48958#1160381, @vsk wrote:

I have some minor comments but overall I think this is in good shape. It would be great to see some compile-time numbers just to make sure this is tractable. I'm pretty sure -fsanitize=null would fire more often across a codebase than this check, so I don't anticipate a big surprise here.

Could you please clarify, which numbers are you looking for, specifically?
The time it takes to build llvm stage2 with -fsanitize=implicit-cast?
Or the time it takes to build llvm stage3 with compiler built with -fsanitize=implicit-cast?
(The numbers won't be too representable, whole stage-1 takes ~40 minutes here...)

lib/CodeGen/CGExprScalar.cpp
316	I'll double-check, but i'm pretty sure then there were some warnings when i did that, Or, the default needs to be defined in the actual declaration of `EmitScalarConversion()`, i think.

In D48958#1160435, @lebedev.ri wrote:

Thank you for taking a look!

In D48958#1160381, @vsk wrote:

I have some minor comments but overall I think this is in good shape. It would be great to see some compile-time numbers just to make sure this is tractable. I'm pretty sure -fsanitize=null would fire more often across a codebase than this check, so I don't anticipate a big surprise here.

Could you please clarify, which numbers are you looking for, specifically?
The time it takes to build llvm stage2 with -fsanitize=implicit-cast?
Or the time it takes to build llvm stage3 with compiler built with -fsanitize=implicit-cast?

I had in mind measuring the difference between -fsanitize=undefined and -fsanitize=undefined,implicit-cast, with a stage2 compiler. I think that captures the expected use case: existing ubsan users enabling this new check.

(The numbers won't be too representable, whole stage-1 takes ~40 minutes here...)

Ah I see, I'll run a few builds and take a stab at it, then.

In D48958#1160479, @vsk wrote:

In D48958#1160435, @lebedev.ri wrote:

Thank you for taking a look!

In D48958#1160381, @vsk wrote:

I have some minor comments but overall I think this is in good shape. It would be great to see some compile-time numbers just to make sure this is tractable. I'm pretty sure -fsanitize=null would fire more often across a codebase than this check, so I don't anticipate a big surprise here.

Could you please clarify, which numbers are you looking for, specifically?
The time it takes to build llvm stage2 with -fsanitize=implicit-cast?
Or the time it takes to build llvm stage3 with compiler built with -fsanitize=implicit-cast?

I had in mind measuring the difference between -fsanitize=undefined and -fsanitize=undefined,implicit-cast, with a stage2 compiler. I think that captures the expected use case: existing ubsan users enabling this new check.

FWIW, i'm trying to look into optimizing these new IR patterns right now D49179 D49247.

(The numbers won't be too representable, whole stage-1 takes ~40 minutes here...)

Ah I see, I'll run a few builds and take a stab at it, then.

Yes, please, thank you!

In D48958#1160494, @lebedev.ri wrote:

In D48958#1160479, @vsk wrote:

In D48958#1160435, @lebedev.ri wrote:

Thank you for taking a look!

In D48958#1160381, @vsk wrote:

I have some minor comments but overall I think this is in good shape. It would be great to see some compile-time numbers just to make sure this is tractable. I'm pretty sure -fsanitize=null would fire more often across a codebase than this check, so I don't anticipate a big surprise here.

Could you please clarify, which numbers are you looking for, specifically?
The time it takes to build llvm stage2 with -fsanitize=implicit-cast?
Or the time it takes to build llvm stage3 with compiler built with -fsanitize=implicit-cast?

I had in mind measuring the difference between -fsanitize=undefined and -fsanitize=undefined,implicit-cast, with a stage2 compiler. I think that captures the expected use case: existing ubsan users enabling this new check.

FWIW, i'm trying to look into optimizing these new IR patterns right now D49179 D49247.

(The numbers won't be too representable, whole stage-1 takes ~40 minutes here...)

Ah I see, I'll run a few builds and take a stab at it, then.

Yes, please, thank you!

The stage2 build traps before it finishes:

FAILED: lib/IR/AttributesCompatFunc.inc.tmp
cd /Users/vsk/src/builds/llvm.org-lldbsan-stage2-R/tools/clang/stage2-bins && /Users/vsk/src/builds/llvm.org-lldbsan-stage2-R/tools/clang/stage2-bins/bin/llvm-tblgen -gen-attrs -I /Users/vsk/src/llvm.org-lldbsan/llvm/lib/IR -I /Users/vsk/src/llvm.org-lldbsan/llvm/include /Users/vsk/src/llvm.org-lldbsan/llvm/lib/IR/AttributesCompatFunc.td -o lib/IR/AttributesCompatFunc.inc.tmp -d lib/IR/AttributesCompatFunc.inc.d
/Users/vsk/src/llvm.org-lldbsan/llvm/include/llvm/ADT/DenseMap.h:732:66: runtime error: implicit cast from type 'uint64_t' (aka 'unsigned long long') of value 4294967296 (64-bit, unsigned) to type 'unsigned int' changed the value to 0 (32-bit, unsigned)
/bin/sh: line 1: 96848 Abort trap: 6

This looks like a false positive to me. It's complaining about static_cast<unsigned>(NextPowerOf2(...)), but the static_cast is explicit.

In D48958#1160848, @vsk wrote:
In D48958#1160494, @lebedev.ri wrote:

In D48958#1160479, @vsk wrote:

In D48958#1160435, @lebedev.ri wrote:

Thank you for taking a look!

In D48958#1160381, @vsk wrote:

I have some minor comments but overall I think this is in good shape. It would be great to see some compile-time numbers just to make sure this is tractable. I'm pretty sure -fsanitize=null would fire more often across a codebase than this check, so I don't anticipate a big surprise here.

Could you please clarify, which numbers are you looking for, specifically?
The time it takes to build llvm stage2 with -fsanitize=implicit-cast?
Or the time it takes to build llvm stage3 with compiler built with -fsanitize=implicit-cast?

I had in mind measuring the difference between -fsanitize=undefined and -fsanitize=undefined,implicit-cast, with a stage2 compiler. I think that captures the expected use case: existing ubsan users enabling this new check.

FWIW, i'm trying to look into optimizing these new IR patterns right now D49179 D49247.

(The numbers won't be too representable, whole stage-1 takes ~40 minutes here...)

Ah I see, I'll run a few builds and take a stab at it, then.

Yes, please, thank you!

The stage2 build traps before it finishes:
FAILED: lib/IR/AttributesCompatFunc.inc.tmp
cd /Users/vsk/src/builds/llvm.org-lldbsan-stage2-R/tools/clang/stage2-bins && /Users/vsk/src/builds/llvm.org-lldbsan-stage2-R/tools/clang/stage2-bins/bin/llvm-tblgen -gen-attrs -I /Users/vsk/src/llvm.org-lldbsan/llvm/lib/IR -I /Users/vsk/src/llvm.org-lldbsan/llvm/include /Users/vsk/src/llvm.org-lldbsan/llvm/lib/IR/AttributesCompatFunc.td -o lib/IR/AttributesCompatFunc.inc.tmp -d lib/IR/AttributesCompatFunc.inc.d
/Users/vsk/src/llvm.org-lldbsan/llvm/include/llvm/ADT/DenseMap.h:732:66: runtime error: implicit cast from type 'uint64_t' (aka 'unsigned long long') of value 4294967296 (64-bit, unsigned) to type 'unsigned int' changed the value to 0 (32-bit, unsigned)
/bin/sh: line 1: 96848 Abort trap: 6
This looks like a false positive to me. It's complaining about static_cast<unsigned>(NextPowerOf2(...)), but the static_cast is explicit.

Good to know, so the stack-based logic for ExplicitCastExpr detection needs further tests/refinements..

In D48958#1160853, @lebedev.ri wrote:
In D48958#1160848, @vsk wrote:
<...>
The stage2 build traps before it finishes:
FAILED: lib/IR/AttributesCompatFunc.inc.tmp
cd /Users/vsk/src/builds/llvm.org-lldbsan-stage2-R/tools/clang/stage2-bins && /Users/vsk/src/builds/llvm.org-lldbsan-stage2-R/tools/clang/stage2-bins/bin/llvm-tblgen -gen-attrs -I /Users/vsk/src/llvm.org-lldbsan/llvm/lib/IR -I /Users/vsk/src/llvm.org-lldbsan/llvm/include /Users/vsk/src/llvm.org-lldbsan/llvm/lib/IR/AttributesCompatFunc.td -o lib/IR/AttributesCompatFunc.inc.tmp -d lib/IR/AttributesCompatFunc.inc.d
/Users/vsk/src/llvm.org-lldbsan/llvm/include/llvm/ADT/DenseMap.h:732:66: runtime error: implicit cast from type 'uint64_t' (aka 'unsigned long long') of value 4294967296 (64-bit, unsigned) to type 'unsigned int' changed the value to 0 (32-bit, unsigned)
/bin/sh: line 1: 96848 Abort trap: 6
This looks like a false positive to me. It's complaining about static_cast<unsigned>(NextPowerOf2(...)), but the static_cast is explicit.
Good to know, so the stack-based logic for ExplicitCastExpr detection needs further tests/refinements..

creduced down to:

template <typename a> a b(a c, const a &d) {
  if (d)
    ;
  return c;
}
int e = b<unsigned>(4, static_cast<unsigned>(4294967296));
int main() {}

https://godbolt.org/g/1kwGk9

$ ./a.out 
test.cpp:6:46: runtime error: implicit cast from type 'long' of value 4294967296 (64-bit, signed) to type 'unsigned int' changed the value to 0 (32-bit, unsigned)
    #0 0x232f56 in _GLOBAL__sub_I_test.cpp (/home/lebedevri/CREDUCE/a.out+0x232f56)
    #1 0x232fbc in __libc_csu_init (/home/lebedevri/CREDUCE/a.out+0x232fbc)
    #2 0x7fa8c113aaa7 in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x22aa7)
    #3 0x212029 in _start (/home/lebedevri/CREDUCE/a.out+0x212029)

@vsk so yeah, no wonder that doesn't work.
Somehow in that test case ScalarExprEmitter::VisitExplicitCastExpr() never gets called.
(I'm pretty sure this worked with the naive implementation, so worst case i'll just revert the 'stack' code)
Trying to assess the issue..

lib/CodeGen/CGExprScalar.cpp

316

[2/14 0.3/sec] Building CXX object tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o
FAILED: tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o 
/usr/bin/clang++-6.0  -DGTEST_HAS_RTTI=0 -D_DEBUG -D_GNU_SOURCE -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -Itools/clang/lib/CodeGen -I/build/clang/lib/CodeGen -I/build/clang/include -Itools/clang/include -I/usr/include/libxml2 -Iinclude -I/build/llvm/include -g0 -fPIC -fvisibility-inlines-hidden -Werror -Werror=date-time -Werror=unguarded-availability-new -std=c++11 -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -pedantic -Wno-long-long -Wcovered-switch-default -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wstring-conversion -fdiagnostics-color -ffunction-sections -fdata-sections -fno-common -Woverloaded-virtual -Wno-nested-anon-types -O3 -g0  -fPIC   -UNDEBUG  -fno-exceptions -fno-rtti -MD -MT tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o -MF tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o.d -o tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o -c /build/clang/lib/CodeGen/CGExprScalar.cpp
/build/clang/lib/CodeGen/CGExprScalar.cpp:355:52: error: default member initializer for 'TreatBooleanAsSigned' needed within definition of enclosing class 'ScalarExprEmitter' outside of member functions
                       ScalarConversionOpts Opts = ScalarConversionOpts());
                                                   ^
/build/clang/lib/CodeGen/CGExprScalar.cpp:349:10: note: default member initializer declared here
    bool TreatBooleanAsSigned = false;
         ^
/build/clang/lib/CodeGen/CGExprScalar.cpp:355:52: error: default member initializer for 'EmitImplicitIntegerTruncationChecks' needed within definition of enclosing class 'ScalarExprEmitter' outside of member functions
                       ScalarConversionOpts Opts = ScalarConversionOpts());
                                                   ^
/build/clang/lib/CodeGen/CGExprScalar.cpp:350:10: note: default member initializer declared here
    bool EmitImplicitIntegerTruncationChecks = false;
         ^
2 errors generated.

Address @vsk review notes, although this will be revered by the next update dropping the faulty 'stack' optimization.

Well, that's just great, with isCastPartOfExplictCast(), the ASTContext::getParents()
also does not return CXXStaticCastExpr as parent for such cases.
I don't know how to proceed.

lebedev.ri added a parent revision: D49320: [InstCombine] Fold 'check for [no] signed truncation' pattern.Jul 16 2018, 9:50 AM

Breakthrough: no more false-positives due to the MaterializeTemporaryExpr skipping over NoOp casts. (D49508)
Slight docs update.

Ping, please review!
We are so close :)

lebedev.ri added a parent revision: D49508: [Sema] Mark implicitly-inserted ICE's as being part of explicit cast (PR38166).Jul 19 2018, 5:05 AM

vsk added inline comments.Jul 19 2018, 12:58 PM

docs/UndefinedBehaviorSanitizer.rst
96	Nitpicks: kind of issues -> issue promotions -> conversions
132	Could you make this more explicit? It would help to point out that this check does not diagnose lossy implicit integer conversions, but that the new check does. Ditto for the comment in the unsigned-integer-overflow section.
lib/CodeGen/CodeGenFunction.h
464	Why not 0 instead of 8, given that in the common case, this stack is unused?
469	I'm not sure the cost of maintaining an extra vector is worth the benefit of the added assertion. Wouldn't it be cheaper to just store the number of pushed casts? You'd only need one constructor which accepts an ArrayRef<const CastExpr *>.
test/CodeGen/catch-implicit-integer-truncations.c
30	There's no need to check the profile metadata here.
160	nit, aren't these true-negatives because we expect to see no errors?

Rebased ontop of yet-again rewritten D49508.
Addressed all @vsk's review notes.

More review notes wanted :)

lebedev.ri added inline comments.Jul 20 2018, 5:35 AM

docs/UndefinedBehaviorSanitizer.rst
132	Is this better?
lib/CodeGen/CodeGenFunction.h
464	No longer relevant.
469	No longer relevant.
test/CodeGen/catch-implicit-integer-truncations.c
30	I was checking it because otherwise `HANDLER_IMPLICIT_CAST` would have over-eagerly consumed `, !prof !3` too. But there is actually a way around that..
160	Right.

vsk added inline comments.Jul 20 2018, 10:34 AM

docs/UndefinedBehaviorSanitizer.rst
132	Looks good.
155–156	Please add "the `implicit-cast` group of checks" to this list.
lib/CodeGen/CodeGenFunction.h
464	I'm referring to CastExprStack within ScalarExprEmitter, which still allocates space for 8 pointers inline.

lebedev.ri added inline comments.Jul 20 2018, 10:41 AM

lib/CodeGen/CodeGenFunction.h
464	Ah, you mean in the general case when the sanitizer is disabled?

vsk added inline comments.Jul 20 2018, 10:57 AM

lib/CodeGen/CodeGenFunction.h
464	Yes. It's a relatively minor concern, but clang's stack can get pretty deep inside of CodeGenFunction. At one point we needed to outline code by hand to unbreak the ASan build. Later I think we just increased the stack size rlimit. I don't see a countervailing performance benefit of allocating more space inline, at least not here.

lebedev.ri added inline comments.Jul 20 2018, 11:27 AM

lib/CodeGen/CodeGenFunction.h
464	No, i agree and totally understand. I just didn't think about that sanitizer-less context.

Address @vsk's review notes.

Rebased on top of svn tip / git master, now that D49508 has landed,
which means there shouldn't be any more false-positives (and it's a bit faster to detect that the check shouldn't be emitted, too).

Ping, any more notes? :)

LGTM, although I think it'd be helpful to have another +1 just to be safe.

I did two small experiments with this using a Stage1 Rel+Asserts compiler:

Stage2 Rel+Asserts build of llvm-tblgen:
ninja llvm-tblgen 384.27s user 14.98s system 1467% cpu 27.203 total

Stage2 Rel+Asserts build of llvm-tblgen with implicit-cast checking:
ninja llvm-tblgen 385.15s user 15.02s system 1472% cpu 27.170 total

With caveats about having a small sample size here and testing with an asserts-enabled stage1 build, I don't see any red flags about the compile-time overhead of the new check. I would have liked to measure the check against more code, but I couldn't complete a stage2 build due to a diagnostic which might plausibly point to a real issue in tblgen:

/Users/vsk/src/llvm.org-lldbsan/llvm/utils/TableGen/RegisterInfoEmitter.cpp:604:17: runtime error: implicit cast from type 'int' of value -1 (32-bit, signed) to type 'const unsigned short' changed the value to 65535 (16-bit, unsigned)

With -fno-sanitize-recover=all disabled, I found a few more reports:

/Users/vsk/src/llvm.org-lldbsan/llvm/include/llvm/Object/Archive.h:278:38: runtime error: implicit cast from type 'int' of value -1 (32-bit, signed) to type 'uint16_t' (aka 'unsigned short') changed the value to 65535 (16-bit, unsigned)
--> uint16_t FirstRegularStartOfFile = -1;

/Users/vsk/src/llvm.org-lldbsan/llvm/lib/Analysis/MemorySSA.cpp:199:12: runtime error: implicit cast from type 'size_t' (aka 'unsigned long') of value 4969132974595412838 (64-bit, unsigned) to type 'unsigned int' changed the value to 3765474150 (32-bit, unsigned)
--> hash_combine() result casted to unsigned

/Users/vsk/src/llvm.org-lldbsan/llvm/lib/CodeGen/TargetLoweringBase.cpp:1212:30: runtime error: implicit cast from type 'unsigned int' of value 512 (32-bit, unsigned) to type 'unsigned char' changed the value to 0 (8-bit, unsigned)
--> NumRegistersForVT[i] = getVectorTypeBreakdownMVT(...)

/Users/vsk/src/llvm.org-lldbsan/llvm/lib/Transforms/Scalar/EarlyCSE.cpp:136:12: runtime error: implicit cast from type 'size_t' (aka 'unsigned long') of value 16583795711468875482 (64-bit, unsigned) to type 'unsigned int' changed the value to 3116347098 (32-bit, unsigned)
--> hash_combine() result casted to unsigned
...

These four at least don't look like false positives:

Maybe we should consider special-casing assignments of "-1" to unsigned values? This seems somewhat idiomatic.
At least a few of these are due to not being explicit about dropping the high bits of hash_combine()'s result. Given that this check is opt-in, that that seems like a legitimate diagnostic (lost entropy).
The TargetLoweringBase.cpp diagnostic looks a bit scary.

lib/CodeGen/CGExprScalar.cpp
951	nit, extra parens?

This revision is now accepted and ready to land.Jul 24 2018, 11:47 AM

In D48958#1173860, @vsk wrote:

LGTM, although I think it'd be helpful to have another +1 just to be safe.

Thank you for the review!
It would indeed be great if someone else could take a look, especially since we are so close to the branching point.

In D48958#1173860, @vsk wrote:

...

In D48958#1173860, @vsk wrote:

These four at least don't look like false positives:

Maybe we should consider special-casing assignments of "-1" to unsigned values? This seems somewhat idiomatic.

I personally would use ~0U there.
One more datapoint: the implicit-sign-change will/should still complain about that case.
So personally i'd like to keep it.

At least a few of these are due to not being explicit about dropping the high bits of hash_combine()'s result. Given that this check is opt-in, that that seems like a legitimate diagnostic (lost entropy).

The TargetLoweringBase.cpp diagnostic looks a bit scary.

lebedev.ri added inline comments.Jul 25 2018, 2:46 PM

lib/CodeGen/CGExprScalar.cpp
944–968	Based on IRC disscussion with @rsmith, it seems this should be just `return !Cast->getIsPartOfExplicitCast();` (and inline it), and no need for the `CastExprStack` and stuff.

lebedev.ri mentioned this in D49844: [AST] Add a isActuallyImplicitCast() helper to the CastExpr class..Jul 26 2018, 4:14 AM

Address @rsmith & @erichkeane [IRC] review notes:

D49838 - [AST] Sink 'part of explicit cast' down into ImplicitCastExpr
D49844 - [AST] Add a isActuallyImplicitCast() helper to the CastExpr class.
Drop no longer needed CastExprStackGuard, ScalarExprEmitter::IsTopCastPartOfExplictCast(), just use CastExpr::isActuallyImplicitCast() directly.

This should be a NFC change, there should not be any functionality change because of this.

test/CodeGenCXX/catch-implicit-integer-truncations.cpp
9–34	@rsmith these tests should be equivalent to what you have brought up, so that situation was already tested.

lebedev.ri added a parent revision: D49844: [AST] Add a isActuallyImplicitCast() helper to the CastExpr class..Jul 26 2018, 4:19 AM

1 Nit, otherwise LGTM.

docs/UndefinedBehaviorSanitizer.rst
94	I think the last 2 commas in this sentence are unnecessary?

aaron.ballman added inline comments.Jul 26 2018, 8:16 AM

docs/UndefinedBehaviorSanitizer.rst
92–96	How about: `Implicit cast from a value of integral type which results in data loss where the demoted value, when cast back to the original type, would have a different value than the original. This issue may be caused by an implicit conversion.`

Small rewording in docs/UndefinedBehaviorSanitizer.rst thanks to @erichkeane & @aaron.ballman!

docs/UndefinedBehaviorSanitizer.rst
92–96	Thank you!

Rebase,
Address @rsmith review notes - just inline D49844.

rsmith added inline comments.Jul 26 2018, 4:30 PM

docs/ReleaseNotes.rst
49–52	Regarding the name of this sanitizer: C and C++ refer to these as "implicit conversions" not "implicit casts", and "implicit cast" is a contradiction in terms -- a cast is explicit syntax for performing a conversion. We should use the external terminology here ("implicit-conversion") rather than the slightly-odd clang-specific convention of calling an implicit conversion an "implicit cast".
304	got -> may have been
306	implicit -> explicit
docs/UndefinedBehaviorSanitizer.rst
17	Don't use Title Caps here. "Problematic implicit conversions"
94	I would parenthesize the "where the demoted value [...] would have a different value from the original" clause, since it's just explaining what we mean by "data loss".
94	Is this really the right rule, though? Consider: unsigned int x = 0x81234567; int y = x; // does the sanitizer catch this or not? Here, the value of `x` is not the same as the value of `y` (assuming 32-bit int): `y` is negative. But this is not "data loss" according to the documented meaning of the sanitizer. I think we should produce a sanitizer trap on this case.
130–137	Remove the "Please"s here. We don't need to beg the reader to read the rest of the sentence. Just "Note that this [...]. Also note that integer conversions may result in an unexpected computation result, [...]"
139–145	Likewise here.
include/clang/Basic/Sanitizers.def
139–141	`-fsanitize=integer` should include `-fsanitize=implicit-integer-truncation`.
lib/CodeGen/CGExprScalar.cpp
954–955	Check the Clang types here, not the LLVM types. There is no guarantee that only integer types get converted to LLVM `IntegerType`s. (But if you like, you can assert that `SrcTy` and `DstTy` are `IntegerType`s after checking that the clang types are both integer types.)
956–958	I believe this is redundant: we don't get here for an integer to boolean conversion, and a boolean to integer conversion would always be caught by the bit width check below.
962–964	I think you should also catch casts that change signedness in the case if the sign bit is set on the value. (Though if you want to defer this to a follow-up change and maybe give the sanitizer a different name, that's fine with me.)

Hopefully address @rsmith review notes:

s/cast/conversion/ where appropriate
Some wording in docs
Some 'legality' checks in ScalarExprEmitter::EmitIntegerTruncationCheck().

Oops, forgot to submit the inline comments.
(It is inconvenient that they aren't submitted with the rest.)

docs/ReleaseNotes.rst
306	Whoops.
docs/UndefinedBehaviorSanitizer.rst
94	I've reverted this to my original text. It should now convey the correct idea, but i'm not sure this is correct English. unsigned int x = 0x81234567; int y = x; // does the sanitizer catch this or not? No, it does not. It indeed should. I do plan on following-up with that, thus i've adding the group (`-fsanitize=implicit-conversion`), not just one check.
include/clang/Basic/Sanitizers.def
139–141	Wow. Ok.
lib/CodeGen/CGExprScalar.cpp
954–955	Interesting, ok.
956–958	Uhm, i'll replace it with an assert then.
962–964	Yes, thank you for bringing this up. That is certainly the plain, but i always planned to add that later on.

Only comments on documentation and assertions. Feel free to commit once these are addressed to your satisfaction.

docs/ReleaseNotes.rst
310	"Just like -fsanitize=integer" -> "Just like other -fsanitize=integer checks", now that this is part of `-fsanitize=integer`.
docs/UndefinedBehaviorSanitizer.rst
17	I don't think it makes sense to list this here, as it's not undefined behavior, and this is a list of undefined behavior that UBSan catches. (And I think it makes sense from a communication perspective to consider the non-UB checks to be "not part of UBSan but handled by the same infrastructure".)
94	bigger -> larger ... would read a bit better.
96	I don't think this last sentence adds anything, and it creates the impression that the issue is sometimes caused by something other than implicit integer conversions (which it isn't, as far as I can tell). Maybe just delete the last sentence here? And instead, something like this would be useful: "Issues caught by this sanitizer are not undefined behavior, but are often unintentional."
134–135	I don't think that's true (not until you add a sanitizer for signed <-> unsigned conversions that change the value). `4U / -2` produces the unexpected result `0U` rather than the mathematically-correct result `-2`, and `-fsanitize=implicit-conversion` doesn't catch it. Maybe just strike this sentence for now? In fact... I think this is too much text to be adding to this bulleted list, which is just supposed to summarize the available checks. Maybe replace the description with Signed integer overflow, where the result of a signed integer computation cannot be represented in its type. This includes all the checks covered by ``-ftrapv``, as well as checks for signed division overflow (``INT_MIN/-1``), but not checks for lossy implicit conversions performed before the computation (see ``-fsanitize=implicit-conversion``).
138–145	And here something like: Unsigned integer overflow, where the result of an unsigned integer computation cannot be represented in its type. Unlike signed integer overflow, this is not undefined behavior, but it is often unintentional. This sanitizer does not check for lossy implicit conversions performed before such a computation (see ``-fsanitize=implicit-conversion``).
161	If we're going to list which sanitizers are enabled here, we should list them all: Enables ``signed-integer-overflow``, ``unsigned-integer-overflow``, ``shift``, ``integer-divide-by-zero``, and ``implicit-integer-truncation``.
lib/CodeGen/CGExprScalar.cpp
960	I think it's generally better for the text in an assertion to describe the violated assumption directly: "clang integer type lowered to non-integer llvm type"
968	I think you should only check `DstType` here. The point of the assert is that there is no such thing as a truncation to `bool` (conversion from integer to `bool` is a comparison against `0`, and if we get here for such a case, there's a bug elsewhere). Other than that, `bool` is a perfectly-normal 1-bit unsigned integer type, and doesn't need to be treated as a special case.

Address last portion of @rsmith review notes.

@rsmith, @vsk, @erichkeane - thank you for the review!

Closed by commit rC338288: [clang][ubsan] Implicit Conversion Sanitizer - integer truncation - clang part (authored by lebedevri). · Explain WhyJul 30 2018, 11:59 AM

This revision was automatically updated to reflect the committed changes.

Diffusion mentioned this in rCRT338287: [compiler-rt][ubsan] Implicit Conversion Sanitizer - integer truncation….

Diffusion mentioned this in rL338287: [compiler-rt][ubsan] Implicit Conversion Sanitizer - integer truncation….

lebedev.ri added inline comments.Jul 30 2018, 11:59 AM

docs/UndefinedBehaviorSanitizer.rst
134–135	I will assume you meant "lossy implicit conversions performed after the computation".

rsmith added inline comments.Jul 30 2018, 1:58 PM

docs/UndefinedBehaviorSanitizer.rst
134–135	I really meant "performed before", for cases like `4u / -2`, where `-2` is implicitly converted to `UINT_MAX - 2` before the computation. Conversions that are performed after a computation aren't part of the computation at all, so I think it's much clearer that they're not in scope for this sanitizer.

lebedev.ri added inline comments.Jul 30 2018, 2:01 PM

docs/UndefinedBehaviorSanitizer.rst
134–135	Ok, with that additional explanation, i do see the error of my ways, and will re-adjust the docs accordingly. Sorry.

lebedev.ri mentioned this in D50250: [clang][ubsan] Implicit Conversion Sanitizer - integer sign change - clang part.Aug 3 2018, 6:22 AM

lebedev.ri added a child revision: D50250: [clang][ubsan] Implicit Conversion Sanitizer - integer sign change - clang part.

lebedev.ri added a child revision: D50901: [clang][ubsan] Split Implicit Integer Truncation Sanitizer into unsigned and signed checks.Aug 17 2018, 8:03 AM

Diffusion mentioned this in rL345660: [clang][ubsan] Implicit Conversion Sanitizer - integer sign change - clang part.Oct 30 2018, 3:03 PM

Diffusion mentioned this in rC345660: [clang][ubsan] Implicit Conversion Sanitizer - integer sign change - clang part.

Revision Contents

Path

Size

docs/

ReleaseNotes.rst

32 lines

UndefinedBehaviorSanitizer.rst

34 lines

include/

clang/

Basic/

Sanitizers.h

3 lines

Sanitizers.def

9 lines

lib/

CodeGen/

CGExprScalar.cpp

107 lines

CodeGenFunction.h

1 line

Driver/

SanitizerArgs.cpp

10 lines

ToolChain.cpp

4 lines

test/

CodeGen/

catch-implicit-integer-truncations.c

395 lines

CodeGenCXX/

catch-implicit-integer-truncations.cpp

256 lines

Driver/

fsanitize.c

17 lines

Diff 158035

docs/ReleaseNotes.rst

	Show All 40 Lines
	Some of the major new features and improvements to Clang are listed			Some of the major new features and improvements to Clang are listed
	here. Generic improvements to Clang as a whole or to its underlying			here. Generic improvements to Clang as a whole or to its underlying
	infrastructure are described first, followed by language-specific			infrastructure are described first, followed by language-specific
	sections with improvements to Clang's support for those languages.			sections with improvements to Clang's support for those languages.

	Major New Features			Major New Features
	------------------			------------------

	- ...			- A new Implicit Conversion Sanitizer (``-fsanitize=implicit-conversion``) group
				was added. Please refer to the :ref:`release-notes-ubsan` section of the
				release notes for the details.

				rsmithUnsubmitted Done Reply Inline Actions Regarding the name of this sanitizer: C and C++ refer to these as "implicit conversions" not "implicit casts", and "implicit cast" is a contradiction in terms -- a cast is explicit syntax for performing a conversion. We should use the external terminology here ("implicit-conversion") rather than the slightly-odd clang-specific convention of calling an implicit conversion an "implicit cast". rsmith: Regarding the name of this sanitizer: C and C++ refer to these as "implicit conversions" not…
	Improvements to Clang's diagnostics			Improvements to Clang's diagnostics
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

	- ``-Wc++98-compat-extra-semi`` is a new flag, which was previously inseparable			- ``-Wc++98-compat-extra-semi`` is a new flag, which was previously inseparable
	from ``-Wc++98-compat-pedantic``. The latter still controls the new flag.			from ``-Wc++98-compat-pedantic``. The latter still controls the new flag.

	- ``-Wextra-semi`` now also controls ``-Wc++98-compat-extra-semi``.			- ``-Wextra-semi`` now also controls ``-Wc++98-compat-extra-semi``.
	Please do note that if you pass ``-Wno-c++98-compat-pedantic``, it implies			Please do note that if you pass ``-Wno-c++98-compat-pedantic``, it implies
	▲ Show 20 Lines • Show All 216 Lines • ▼ Show 20 Lines

	Static Analyzer			Static Analyzer
	---------------			---------------

	- ...			- ...

	...			...

				.. _release-notes-ubsan:

	Undefined Behavior Sanitizer (UBSan)			Undefined Behavior Sanitizer (UBSan)
	------------------------------------			------------------------------------

	* ...			* A new Implicit Conversion Sanitizer (``-fsanitize=implicit-conversion``) group
				was added.

				Currently, only one type of issues is caught - implicit integer truncation
				(``-fsanitize=implicit-integer-truncation``), also known as integer demotion.
				While there is a ``-Wconversion`` diagnostic group that catches this kind of
				issues, it is both noisy, and does not catch all the cases.

				.. code-block:: c++

				unsigned char store = 0;

				bool consume(unsigned int val);

				void test(unsigned long val) {
				rsmithUnsubmitted Done Reply Inline Actions got -> may have been rsmith: got -> may have been
				if (consume(val)) // the value may have been silently truncated.
				store = store + 768; // before addition, 'store' was promoted to int.
				rsmithUnsubmitted Done Reply Inline Actions implicit -> explicit rsmith: implicit -> explicit
				lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions Whoops. lebedev.ri: Whoops.
				(void)consume((unsigned int)val); // OK, the truncation is explicit.
				}

				Just like other ``-fsanitize=integer`` checks, these issues are not
				rsmithUnsubmitted Done Reply Inline Actions "Just like -fsanitize=integer" -> "Just like other -fsanitize=integer checks", now that this is part of `-fsanitize=integer`. rsmith: "Just like -fsanitize=integer" -> "Just like other -fsanitize=integer checks", now that this is…
				undefined behaviour. But they are not always intentional, and are somewhat
				vskUnsubmitted Done Reply Inline Actions Could you mention whether the group is enabled by -fsanitize=undefined? vsk: Could you mention whether the group is enabled by -fsanitize=undefined?
				hard to track down. This group is not enabled by ``-fsanitize=undefined``,
				but the ``-fsanitize=implicit-integer-truncation`` check
				is enabled by ``-fsanitize=integer``.

	Core Analysis Improvements			Core Analysis Improvements
	==========================			==========================

	- ...			- ...

	New Issues Found			New Issues Found
	================			================
	Show All 26 Lines

docs/UndefinedBehaviorSanitizer.rst

==========================		==========================
UndefinedBehaviorSanitizer		UndefinedBehaviorSanitizer
==========================		==========================

.. contents::		.. contents::
:local:		:local:

Introduction		Introduction
============		============

UndefinedBehaviorSanitizer (UBSan) is a fast undefined behavior detector.		UndefinedBehaviorSanitizer (UBSan) is a fast undefined behavior detector.
UBSan modifies the program at compile-time to catch various kinds of undefined		UBSan modifies the program at compile-time to catch various kinds of undefined
behavior during program execution, for example:		behavior during program execution, for example:

* Using misaligned or null pointer		* Using misaligned or null pointer
* Signed integer overflow		* Signed integer overflow
* Conversion to, from, or between floating-point types which would		* Conversion to, from, or between floating-point types which would
		rsmithUnsubmitted Done Reply Inline Actions Don't use Title Caps here. "Problematic implicit conversions" rsmith: Don't use Title Caps here. "Problematic implicit conversions"
		rsmithUnsubmitted Done Reply Inline Actions I don't think it makes sense to list this here, as it's not undefined behavior, and this is a list of undefined behavior that UBSan catches. (And I think it makes sense from a communication perspective to consider the non-UB checks to be "not part of UBSan but handled by the same infrastructure".) rsmith: I don't think it makes sense to list this here, as it's not undefined behavior, and this is a…
overflow the destination		overflow the destination

See the full list of available :ref:`checks <ubsan-checks>` below.		See the full list of available :ref:`checks <ubsan-checks>` below.

UBSan has an optional run-time library which provides better error reporting.		UBSan has an optional run-time library which provides better error reporting.
The checks have small runtime cost and no impact on address space layout or ABI.		The checks have small runtime cost and no impact on address space layout or ABI.

How to build		How to build
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	Available checks are:
- ``-fsanitize=float-cast-overflow``: Conversion to, from, or		- ``-fsanitize=float-cast-overflow``: Conversion to, from, or
between floating-point types which would overflow the		between floating-point types which would overflow the
destination.		destination.
- ``-fsanitize=float-divide-by-zero``: Floating point division by		- ``-fsanitize=float-divide-by-zero``: Floating point division by
zero.		zero.
- ``-fsanitize=function``: Indirect call of a function through a		- ``-fsanitize=function``: Indirect call of a function through a
function pointer of the wrong type (Darwin/Linux, C++ and x86/x86_64		function pointer of the wrong type (Darwin/Linux, C++ and x86/x86_64
only).		only).
		- ``-fsanitize=implicit-integer-truncation``: Implicit conversion from
		integer of larger bit width to smaller bit width, if that results in data
		loss. That is, if the demoted value, after casting back to the original
		erichkeaneUnsubmitted Not Done Reply Inline Actions I think the last 2 commas in this sentence are unnecessary? erichkeane: I think the last 2 commas in this sentence are unnecessary?
		rsmithUnsubmitted Not Done Reply Inline Actions I would parenthesize the "where the demoted value [...] would have a different value from the original" clause, since it's just explaining what we mean by "data loss". rsmith: I would parenthesize the "where the demoted value [...] would have a different value from the…
		rsmithUnsubmitted Not Done Reply Inline Actions Is this really the right rule, though? Consider: unsigned int x = 0x81234567; int y = x; // does the sanitizer catch this or not? Here, the value of `x` is not the same as the value of `y` (assuming 32-bit int): `y` is negative. But this is not "data loss" according to the documented meaning of the sanitizer. I think we should produce a sanitizer trap on this case. rsmith: Is this really the right rule, though? Consider: ``` unsigned int x = 0x81234567; int y = x…
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions I've reverted this to my original text. It should now convey the correct idea, but i'm not sure this is correct English. unsigned int x = 0x81234567; int y = x; // does the sanitizer catch this or not? No, it does not. It indeed should. I do plan on following-up with that, thus i've adding the group (`-fsanitize=implicit-conversion`), not just one check. lebedev.ri: I've reverted this to my original text. It should now convey the correct idea, but i'm not sure…
		rsmithUnsubmitted Done Reply Inline Actions bigger -> larger ... would read a bit better. rsmith: bigger -> larger ... would read a bit better.
		width, is not equal to the original value before the downcast.
		Issues caught by this sanitizer are not undefined behavior,
		vskUnsubmitted Done Reply Inline Actions Nitpicks: kind of issues -> issue promotions -> conversions vsk: Nitpicks: kind of issues -> issue promotions -> conversions
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions How about: `Implicit cast from a value of integral type which results in data loss where the demoted value, when cast back to the original type, would have a different value than the original. This issue may be caused by an implicit conversion.` aaron.ballman: How about: `Implicit cast from a value of integral type which results in data loss where the…
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions Thank you! lebedev.ri: Thank you!
		rsmithUnsubmitted Done Reply Inline Actions I don't think this last sentence adds anything, and it creates the impression that the issue is sometimes caused by something other than implicit integer conversions (which it isn't, as far as I can tell). Maybe just delete the last sentence here? And instead, something like this would be useful: "Issues caught by this sanitizer are not undefined behavior, but are often unintentional." rsmith: I don't think this last sentence adds anything, and it creates the impression that the issue is…
		but are often unintentional.
- ``-fsanitize=integer-divide-by-zero``: Integer division by zero.		- ``-fsanitize=integer-divide-by-zero``: Integer division by zero.
- ``-fsanitize=nonnull-attribute``: Passing null pointer as a function		- ``-fsanitize=nonnull-attribute``: Passing null pointer as a function
parameter which is declared to never be null.		parameter which is declared to never be null.
- ``-fsanitize=null``: Use of a null pointer or creation of a null		- ``-fsanitize=null``: Use of a null pointer or creation of a null
reference.		reference.
- ``-fsanitize=nullability-arg``: Passing null as a function parameter		- ``-fsanitize=nullability-arg``: Passing null as a function parameter
which is annotated with ``_Nonnull``.		which is annotated with ``_Nonnull``.
- ``-fsanitize=nullability-assign``: Assigning null to an lvalue which		- ``-fsanitize=nullability-assign``: Assigning null to an lvalue which
Show All 16 Lines	- ``-fsanitize=returns-nonnull-attribute``: Returning null pointer
from a function which is declared to never return null.		from a function which is declared to never return null.
- ``-fsanitize=shift``: Shift operators where the amount shifted is		- ``-fsanitize=shift``: Shift operators where the amount shifted is
greater or equal to the promoted bit-width of the left hand side		greater or equal to the promoted bit-width of the left hand side
or less than zero, or where the left hand side is negative. For a		or less than zero, or where the left hand side is negative. For a
signed left shift, also checks for signed overflow in C, and for		signed left shift, also checks for signed overflow in C, and for
unsigned overflow in C++. You can use ``-fsanitize=shift-base`` or		unsigned overflow in C++. You can use ``-fsanitize=shift-base`` or
``-fsanitize=shift-exponent`` to check only left-hand side or		``-fsanitize=shift-exponent`` to check only left-hand side or
right-hand side of shift operation, respectively.		right-hand side of shift operation, respectively.
- ``-fsanitize=signed-integer-overflow``: Signed integer overflow,		- ``-fsanitize=signed-integer-overflow``: Signed integer overflow, where the
including all the checks added by ``-ftrapv``, and checking for		result of a signed integer computation cannot be represented in its type.
overflow in signed division (``INT_MIN / -1``).		This includes all the checks covered by ``-ftrapv``, as well as checks for
		vskUnsubmitted Done Reply Inline Actions Could you make this more explicit? It would help to point out that this check does not diagnose lossy implicit integer conversions, but that the new check does. Ditto for the comment in the unsigned-integer-overflow section. vsk: Could you make this more explicit? It would help to point out that this check does not diagnose…
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions Is this better? lebedev.ri: Is this better?
		vskUnsubmitted Done Reply Inline Actions Looks good. vsk: Looks good.
		signed division overflow (``INT_MIN/-1``), but not checks for
		lossy implicit conversions performed after the computation
		(see ``-fsanitize=implicit-conversion``). Both of these two issues are
		rsmithUnsubmitted Done Reply Inline Actions I don't think that's true (not until you add a sanitizer for signed <-> unsigned conversions that change the value). `4U / -2` produces the unexpected result `0U` rather than the mathematically-correct result `-2`, and `-fsanitize=implicit-conversion` doesn't catch it. Maybe just strike this sentence for now? In fact... I think this is too much text to be adding to this bulleted list, which is just supposed to summarize the available checks. Maybe replace the description with Signed integer overflow, where the result of a signed integer computation cannot be represented in its type. This includes all the checks covered by ``-ftrapv``, as well as checks for signed division overflow (``INT_MIN/-1``), but not checks for lossy implicit conversions performed before the computation (see ``-fsanitize=implicit-conversion``). rsmith: I don't think that's true (not until you add a sanitizer for signed <-> unsigned conversions…
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions I will assume you meant "lossy implicit conversions performed after the computation". lebedev.ri: I will assume you meant "lossy implicit conversions performed after the computation".
		rsmithUnsubmitted Not Done Reply Inline Actions I really meant "performed before", for cases like `4u / -2`, where `-2` is implicitly converted to `UINT_MAX - 2` before the computation. Conversions that are performed after a computation aren't part of the computation at all, so I think it's much clearer that they're not in scope for this sanitizer. rsmith: I really meant "performed before", for cases like `4u / -2`, where `-2` is implicitly converted…
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions Ok, with that additional explanation, i do see the error of my ways, and will re-adjust the docs accordingly. Sorry. lebedev.ri: Ok, with that additional explanation, i do see the error of my ways, and will re-adjust the…
		handled by ``-fsanitize=implicit-conversion`` group of checks.
- ``-fsanitize=unreachable``: If control flow reaches an unreachable		- ``-fsanitize=unreachable``: If control flow reaches an unreachable
		rsmithUnsubmitted Done Reply Inline Actions Remove the "Please"s here. We don't need to beg the reader to read the rest of the sentence. Just "Note that this [...]. Also note that integer conversions may result in an unexpected computation result, [...]" rsmith: Remove the "Please"s here. We don't need to beg the reader to read the rest of the sentence.
program point.		program point.
- ``-fsanitize=unsigned-integer-overflow``: Unsigned integer		- ``-fsanitize=unsigned-integer-overflow``: Unsigned integer overflow, where
overflows. Note that unlike signed integer overflow, unsigned integer		the result of an unsigned integer computation cannot be represented in its
is not undefined behavior. However, while it has well-defined semantics,		type. Unlike signed integer overflow, this is not undefined behavior, but
it is often unintentional, so UBSan offers to catch it.		it is often unintentional. This sanitizer does not check for lossy implicit
		conversions performed after such a computation
		(see ``-fsanitize=implicit-conversion``).
- ``-fsanitize=vla-bound``: A variable-length array whose bound		- ``-fsanitize=vla-bound``: A variable-length array whose bound
		rsmithUnsubmitted Done Reply Inline Actions Likewise here. rsmith: Likewise here.
		rsmithUnsubmitted Done Reply Inline Actions And here something like: Unsigned integer overflow, where the result of an unsigned integer computation cannot be represented in its type. Unlike signed integer overflow, this is not undefined behavior, but it is often unintentional. This sanitizer does not check for lossy implicit conversions performed before such a computation (see ``-fsanitize=implicit-conversion``). rsmith: And here something like: Unsigned integer overflow, where the result of an unsigned…
does not evaluate to a positive value.		does not evaluate to a positive value.
- ``-fsanitize=vptr``: Use of an object whose vptr indicates that it is of		- ``-fsanitize=vptr``: Use of an object whose vptr indicates that it is of
the wrong dynamic type, or that its lifetime has not begun or has ended.		the wrong dynamic type, or that its lifetime has not begun or has ended.
Incompatible with ``-fno-rtti``. Link must be performed by ``clang++``, not		Incompatible with ``-fno-rtti``. Link must be performed by ``clang++``, not
``clang``, to make sure C++-specific parts of the runtime library and C++		``clang``, to make sure C++-specific parts of the runtime library and C++
standard libraries are present.		standard libraries are present.

You can also use the following check groups:		You can also use the following check groups:
- ``-fsanitize=undefined``: All of the checks listed above other than		- ``-fsanitize=undefined``: All of the checks listed above other than
``unsigned-integer-overflow`` and the ``nullability-*`` checks.		``unsigned-integer-overflow``, ``implicit-conversion`` and the
		``nullability-*`` group of checks.
		vskUnsubmitted Done Reply Inline Actions Please add "the `implicit-cast` group of checks" to this list. vsk: Please add "the `implicit-cast` group of checks" to this list.
- ``-fsanitize=undefined-trap``: Deprecated alias of		- ``-fsanitize=undefined-trap``: Deprecated alias of
``-fsanitize=undefined``.		``-fsanitize=undefined``.
- ``-fsanitize=integer``: Checks for undefined or suspicious integer		- ``-fsanitize=integer``: Checks for undefined or suspicious integer
behavior (e.g. unsigned integer overflow).		behavior (e.g. unsigned integer overflow).
		Enables ``signed-integer-overflow``, ``unsigned-integer-overflow``,
		rsmithUnsubmitted Done Reply Inline Actions If we're going to list which sanitizers are enabled here, we should list them all: Enables ``signed-integer-overflow``, ``unsigned-integer-overflow``, ``shift``, ``integer-divide-by-zero``, and ``implicit-integer-truncation``. rsmith: If we're going to list which sanitizers are enabled here, we should list them all: Enables…
		``shift``, ``integer-divide-by-zero``, and ``implicit-integer-truncation``.
		- ``-fsanitize=implicit-conversion``: Checks for suspicious behaviours of
		implicit conversions.
		Currently, only ``-fsanitize=implicit-integer-truncation`` is implemented.
- ``-fsanitize=nullability``: Enables ``nullability-arg``,		- ``-fsanitize=nullability``: Enables ``nullability-arg``,
``nullability-assign``, and ``nullability-return``. While violating		``nullability-assign``, and ``nullability-return``. While violating
nullability does not have undefined behavior, it is often unintentional,		nullability does not have undefined behavior, it is often unintentional,
so UBSan offers to catch it.		so UBSan offers to catch it.

Volatile		Volatile
--------		--------

▲ Show 20 Lines • Show All 140 Lines • Show Last 20 Lines

include/clang/Basic/Sanitizers.h

	Show First 20 Lines • Show All 78 Lines • ▼ Show 20 Lines

	/// For each sanitizer group bit set in \p Kinds, set the bits for sanitizers			/// For each sanitizer group bit set in \p Kinds, set the bits for sanitizers
	/// this group enables.			/// this group enables.
	SanitizerMask expandSanitizerGroups(SanitizerMask Kinds);			SanitizerMask expandSanitizerGroups(SanitizerMask Kinds);

	/// Return the sanitizers which do not affect preprocessing.			/// Return the sanitizers which do not affect preprocessing.
	inline SanitizerMask getPPTransparentSanitizers() {			inline SanitizerMask getPPTransparentSanitizers() {
	return SanitizerKind::CFI \| SanitizerKind::Integer \|			return SanitizerKind::CFI \| SanitizerKind::Integer \|
	SanitizerKind::Nullability \| SanitizerKind::Undefined;			SanitizerKind::ImplicitConversion \| SanitizerKind::Nullability \|
				SanitizerKind::Undefined;
	}			}

	} // namespace clang			} // namespace clang

	#endif // LLVM_CLANG_BASIC_SANITIZERS_H			#endif // LLVM_CLANG_BASIC_SANITIZERS_H

include/clang/Basic/Sanitizers.def

Show First 20 Lines • Show All 125 Lines • ▼ Show 20 Lines	SANITIZER_GROUP("undefined", Undefined,
IntegerDivideByZero \| NonnullAttribute \| Null \| ObjectSize \|		IntegerDivideByZero \| NonnullAttribute \| Null \| ObjectSize \|
PointerOverflow \| Return \| ReturnsNonnullAttribute \| Shift \|		PointerOverflow \| Return \| ReturnsNonnullAttribute \| Shift \|
SignedIntegerOverflow \| Unreachable \| VLABound \| Function \|		SignedIntegerOverflow \| Unreachable \| VLABound \| Function \|
Vptr)		Vptr)

// -fsanitize=undefined-trap is an alias for -fsanitize=undefined.		// -fsanitize=undefined-trap is an alias for -fsanitize=undefined.
SANITIZER_GROUP("undefined-trap", UndefinedTrap, Undefined)		SANITIZER_GROUP("undefined-trap", UndefinedTrap, Undefined)

		// ImplicitConversionSanitizer
		SANITIZER("implicit-integer-truncation", ImplicitIntegerTruncation)
		SANITIZER_GROUP("implicit-conversion", ImplicitConversion,
		ImplicitIntegerTruncation)

SANITIZER_GROUP("integer", Integer,		SANITIZER_GROUP("integer", Integer,
SignedIntegerOverflow \| UnsignedIntegerOverflow \| Shift \|		ImplicitIntegerTruncation \| IntegerDivideByZero \| Shift \|
IntegerDivideByZero)		SignedIntegerOverflow \| UnsignedIntegerOverflow)
		rsmithUnsubmitted Done Reply Inline Actions `-fsanitize=integer` should include `-fsanitize=implicit-integer-truncation`. rsmith: `-fsanitize=integer` should include `-fsanitize=implicit-integer-truncation`.
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions Wow. Ok. lebedev.ri: Wow. Ok.

SANITIZER("local-bounds", LocalBounds)		SANITIZER("local-bounds", LocalBounds)
SANITIZER_GROUP("bounds", Bounds, ArrayBounds \| LocalBounds)		SANITIZER_GROUP("bounds", Bounds, ArrayBounds \| LocalBounds)

// EfficiencySanitizer		// EfficiencySanitizer
SANITIZER("efficiency-cache-frag", EfficiencyCacheFrag)		SANITIZER("efficiency-cache-frag", EfficiencyCacheFrag)
SANITIZER("efficiency-working-set", EfficiencyWorkingSet)		SANITIZER("efficiency-working-set", EfficiencyWorkingSet)
// Meta-group only used internally.		// Meta-group only used internally.
Show All 12 Lines

lib/CodeGen/CGExprScalar.cpp

Show First 20 Lines • Show All 211 Lines • ▼ Show 20 Lines

class ScalarExprEmitter		class ScalarExprEmitter
: public StmtVisitor<ScalarExprEmitter, Value*> {		: public StmtVisitor<ScalarExprEmitter, Value*> {
CodeGenFunction &CGF;		CodeGenFunction &CGF;
CGBuilderTy &Builder;		CGBuilderTy &Builder;
bool IgnoreResultAssign;		bool IgnoreResultAssign;
llvm::LLVMContext &VMContext;		llvm::LLVMContext &VMContext;
public:		public:

		vskUnsubmitted Done Reply Inline Actions It would help to have this comment explain that the stack is used/maintained exclusively by the implicit cast sanitizer. vsk: It would help to have this comment explain that the stack is used/maintained exclusively by the…
ScalarExprEmitter(CodeGenFunction &cgf, bool ira=false)		ScalarExprEmitter(CodeGenFunction &cgf, bool ira=false)
: CGF(cgf), Builder(CGF.Builder), IgnoreResultAssign(ira),		: CGF(cgf), Builder(CGF.Builder), IgnoreResultAssign(ira),
VMContext(cgf.getLLVMContext()) {		VMContext(cgf.getLLVMContext()) {
}		}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
		vskUnsubmitted Done Reply Inline Actions Could you make this comment more specific -- maybe by explaining that for efficiency reasons, the cast expr stack is only maintained when a sanitizer check is enabled? vsk: Could you make this comment more specific -- maybe by explaining that for efficiency reasons…
// Utilities		// Utilities
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

bool TestAndClearIgnoreResultAssign() {		bool TestAndClearIgnoreResultAssign() {
bool I = IgnoreResultAssign;		bool I = IgnoreResultAssign;
IgnoreResultAssign = false;		IgnoreResultAssign = false;
		vskUnsubmitted Done Reply Inline Actions I think if you were to use references instead of pointers here, the code would be a bit clearer, and you wouldn't need to assert that CE is non-null. vsk: I think if you were to use references instead of pointers here, the code would be a bit clearer…
return I;		return I;
}		}

llvm::Type *ConvertType(QualType T) { return CGF.ConvertType(T); }		llvm::Type *ConvertType(QualType T) { return CGF.ConvertType(T); }
LValue EmitLValue(const Expr *E) { return CGF.EmitLValue(E); }		LValue EmitLValue(const Expr *E) { return CGF.EmitLValue(E); }
LValue EmitCheckedLValue(const Expr *E, CodeGenFunction::TypeCheckKind TCK) {		LValue EmitCheckedLValue(const Expr *E, CodeGenFunction::TypeCheckKind TCK) {
return CGF.EmitCheckedLValue(E, TCK);		return CGF.EmitCheckedLValue(E, TCK);
}		}
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	public:
Value EmitConversionToBool(Value Src, QualType DstTy);		Value EmitConversionToBool(Value Src, QualType DstTy);

/// Emit a check that a conversion to or from a floating-point type does not		/// Emit a check that a conversion to or from a floating-point type does not
/// overflow.		/// overflow.
void EmitFloatConversionCheck(Value *OrigSrc, QualType OrigSrcType,		void EmitFloatConversionCheck(Value *OrigSrc, QualType OrigSrcType,
Value *Src, QualType SrcType, QualType DstType,		Value *Src, QualType SrcType, QualType DstType,
llvm::Type *DstTy, SourceLocation Loc);		llvm::Type *DstTy, SourceLocation Loc);

		/// Known implicit conversion check kinds.
		/// Keep in sync with the enum of the same name in ubsan_handlers.h
		enum ImplicitConversionCheckKind : unsigned char {
		ICCK_IntegerTruncation = 0,
		};

		/// Emit a check that an [implicit] truncation of an integer does not
		/// discard any bits. It is not UB, so we use the value after truncation.
		void EmitIntegerTruncationCheck(Value Src, QualType SrcType, Value Dst,
		QualType DstType, SourceLocation Loc);

/// Emit a conversion from the specified type to the specified destination		/// Emit a conversion from the specified type to the specified destination
/// type, both of which are LLVM scalar types.		/// type, both of which are LLVM scalar types.
Value EmitScalarConversion(Value Src, QualType SrcTy, QualType DstTy,		struct ScalarConversionOpts {
		vskUnsubmitted Done Reply Inline Actions I think the number of overloads here is really unwieldy. There should be a simpler way to structure this. What about consolidating all four overloads into one? Maybe: struct ScalarConversionsOpts { bool TreatBoolAsUnsigned = false; bool EmitImplicitIntegerTruncationCheck = false; }; Value EmitScalarConversion(Src, SrcTy, DstTy, Loc, Opts = ScalarConversionOpts()) It's not necessary to pass CastExpr in, right? There's only one place where that's done. It seems simpler to just do the SanOpts / isCastPartOfExplicitCast checking there. vsk:* I think the number of overloads here is really unwieldy. There should be a simpler way to…
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions The number of overloads is indeed unwieldy. lebedev.ri: The number of overloads is indeed unwieldy.
SourceLocation Loc);		bool TreatBooleanAsSigned;
		vskUnsubmitted Done Reply Inline Actions Why not use default member initializers here (e.g, "bool a = false")? vsk: Why not use default member initializers here (e.g, "bool a = false")?
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions I'll double-check, but i'm pretty sure then there were some warnings when i did that, Or, the default needs to be defined in the actual declaration of `EmitScalarConversion()`, i think. lebedev.ri: I'll double-check, but i'm pretty sure then there were some warnings when i did that, Or, the…
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions [2/14 0.3/sec] Building CXX object tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o FAILED: tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o /usr/bin/clang++-6.0 -DGTEST_HAS_RTTI=0 -D_DEBUG -D_GNU_SOURCE -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -Itools/clang/lib/CodeGen -I/build/clang/lib/CodeGen -I/build/clang/include -Itools/clang/include -I/usr/include/libxml2 -Iinclude -I/build/llvm/include -g0 -fPIC -fvisibility-inlines-hidden -Werror -Werror=date-time -Werror=unguarded-availability-new -std=c++11 -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -pedantic -Wno-long-long -Wcovered-switch-default -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wstring-conversion -fdiagnostics-color -ffunction-sections -fdata-sections -fno-common -Woverloaded-virtual -Wno-nested-anon-types -O3 -g0 -fPIC -UNDEBUG -fno-exceptions -fno-rtti -MD -MT tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o -MF tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o.d -o tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.dir/CGExprScalar.cpp.o -c /build/clang/lib/CodeGen/CGExprScalar.cpp /build/clang/lib/CodeGen/CGExprScalar.cpp:355:52: error: default member initializer for 'TreatBooleanAsSigned' needed within definition of enclosing class 'ScalarExprEmitter' outside of member functions ScalarConversionOpts Opts = ScalarConversionOpts()); ^ /build/clang/lib/CodeGen/CGExprScalar.cpp:349:10: note: default member initializer declared here bool TreatBooleanAsSigned = false; ^ /build/clang/lib/CodeGen/CGExprScalar.cpp:355:52: error: default member initializer for 'EmitImplicitIntegerTruncationChecks' needed within definition of enclosing class 'ScalarExprEmitter' outside of member functions ScalarConversionOpts Opts = ScalarConversionOpts()); ^ /build/clang/lib/CodeGen/CGExprScalar.cpp:350:10: note: default member initializer declared here bool EmitImplicitIntegerTruncationChecks = false; ^ 2 errors generated. lebedev.ri: ``` [2/14 0.3/sec] Building CXX object tools/clang/lib/CodeGen/CMakeFiles/clangCodeGen.
		bool EmitImplicitIntegerTruncationChecks;
Value EmitScalarConversion(Value Src, QualType SrcTy, QualType DstTy,
SourceLocation Loc, bool TreatBooleanAsSigned);		ScalarConversionOpts()
		: TreatBooleanAsSigned(false),
		EmitImplicitIntegerTruncationChecks(false) {}
		};
		Value *
		EmitScalarConversion(Value *Src, QualType SrcTy, QualType DstTy,
		SourceLocation Loc,
		ScalarConversionOpts Opts = ScalarConversionOpts());

/// Emit a conversion from the specified complex type to the specified		/// Emit a conversion from the specified complex type to the specified
/// destination type, where the destination type is an LLVM scalar type.		/// destination type, where the destination type is an LLVM scalar type.
Value *EmitComplexToScalarConversion(CodeGenFunction::ComplexPairTy Src,		Value *EmitComplexToScalarConversion(CodeGenFunction::ComplexPairTy Src,
QualType SrcTy, QualType DstTy,		QualType SrcTy, QualType DstTy,
SourceLocation Loc);		SourceLocation Loc);

/// EmitNullValue - Emit a value that corresponds to null for the given type.		/// EmitNullValue - Emit a value that corresponds to null for the given type.
▲ Show 20 Lines • Show All 601 Lines • ▼ Show 20 Lines	void ScalarExprEmitter::EmitFloatConversionCheck(

llvm::Constant *StaticArgs[] = {CGF.EmitCheckSourceLocation(Loc),		llvm::Constant *StaticArgs[] = {CGF.EmitCheckSourceLocation(Loc),
CGF.EmitCheckTypeDescriptor(OrigSrcType),		CGF.EmitCheckTypeDescriptor(OrigSrcType),
CGF.EmitCheckTypeDescriptor(DstType)};		CGF.EmitCheckTypeDescriptor(DstType)};
CGF.EmitCheck(std::make_pair(Check, SanitizerKind::FloatCastOverflow),		CGF.EmitCheck(std::make_pair(Check, SanitizerKind::FloatCastOverflow),
SanitizerHandler::FloatCastOverflow, StaticArgs, OrigSrc);		SanitizerHandler::FloatCastOverflow, StaticArgs, OrigSrc);
}		}

/// Emit a conversion from the specified type to the specified destination type,		void ScalarExprEmitter::EmitIntegerTruncationCheck(Value *Src, QualType SrcType,
/// both of which are LLVM scalar types.		Value *Dst, QualType DstType,
Value ScalarExprEmitter::EmitScalarConversion(Value Src, QualType SrcType,
QualType DstType,
SourceLocation Loc) {		SourceLocation Loc) {
return EmitScalarConversion(Src, SrcType, DstType, Loc, false);		if (!CGF.SanOpts.has(SanitizerKind::ImplicitIntegerTruncation))
		return;

		llvm::Type *SrcTy = Src->getType();
		llvm::Type *DstTy = Dst->getType();
		vskUnsubmitted Done Reply Inline Actions nit, function names typically begin with a verb: 'isCastPartOf...' vsk: nit, function names typically begin with a verb: 'isCastPartOf...'
		vskUnsubmitted Not Done Reply Inline Actions nit, extra parens? vsk: nit, extra parens?

		// We only care about int->int conversions here.
		// We ignore conversions to/from pointer and/or bool.
		if (!(SrcType->isIntegerType() && DstType->isIntegerType()))
		rsmithUnsubmitted Done Reply Inline Actions Check the Clang types here, not the LLVM types. There is no guarantee that only integer types get converted to LLVM `IntegerType`s. (But if you like, you can assert that `SrcTy` and `DstTy` are `IntegerType`s after checking that the clang types are both integer types.) rsmith: Check the Clang types here, not the LLVM types. There is no guarantee that only integer types…
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions Interesting, ok. lebedev.ri: Interesting, ok.
		return;

		vskUnsubmitted Done Reply Inline Actions The none_of call could safely be replaced by `Cast->getSubExpr() != PreviousCast`, I think. vsk: The none_of call could safely be replaced by `Cast->getSubExpr() != PreviousCast`, I think.
		assert(isa<llvm::IntegerType>(SrcTy) && isa<llvm::IntegerType>(DstTy) &&
		rsmithUnsubmitted Done Reply Inline Actions I believe this is redundant: we don't get here for an integer to boolean conversion, and a boolean to integer conversion would always be caught by the bit width check below. rsmith: I believe this is redundant: we don't get here for an integer to boolean conversion, and a…
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions Uhm, i'll replace it with an assert then. lebedev.ri: Uhm, i'll replace it with an assert then.
		"clang integer type lowered to non-integer llvm type");

		rsmithUnsubmitted Done Reply Inline Actions I think it's generally better for the text in an assertion to describe the violated assumption directly: "clang integer type lowered to non-integer llvm type" rsmith: I think it's generally better for the text in an assertion to describe the violated assumption…
		unsigned SrcBits = SrcTy->getScalarSizeInBits();
		unsigned DstBits = DstTy->getScalarSizeInBits();
		// This must be truncation. Else we do not care.
		if (SrcBits <= DstBits)
		rsmithUnsubmitted Not Done Reply Inline Actions I think you should also catch casts that change signedness in the case if the sign bit is set on the value. (Though if you want to defer this to a follow-up change and maybe give the sanitizer a different name, that's fine with me.) rsmith: I think you should also catch casts that change signedness in the case if the sign bit is set…
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions Yes, thank you for bringing this up. That is certainly the plain, but i always planned to add that later on. lebedev.ri: Yes, thank you for bringing this up. That is certainly the plain, but i always planned to add…
		return;

		assert(!DstType->isBooleanType() && "we should not get here with booleans.");

		lebedev.riAuthorUnsubmitted Done Reply Inline Actions Based on IRC disscussion with @rsmith, it seems this should be just `return !Cast->getIsPartOfExplicitCast();` (and inline it), and no need for the `CastExprStack` and stuff. lebedev.ri: Based on IRC disscussion with @rsmith, it seems this should be just `return !Cast…
		rsmithUnsubmitted Done Reply Inline Actions I think you should only check `DstType` here. The point of the assert is that there is no such thing as a truncation to `bool` (conversion from integer to `bool` is a comparison against `0`, and if we get here for such a case, there's a bug elsewhere). Other than that, `bool` is a perfectly-normal 1-bit unsigned integer type, and doesn't need to be treated as a special case. rsmith: I think you should only check `DstType` here. The point of the assert is that there is no such…
		CodeGenFunction::SanitizerScope SanScope(&CGF);

		llvm::Value *Check = nullptr;

		// 1. Extend the truncated value back to the same width as the Src.
		bool InputSigned = DstType->isSignedIntegerOrEnumerationType();
		Check = Builder.CreateIntCast(Dst, SrcTy, InputSigned, "anyext");
		// 2. Equality-compare with the original source value
		Check = Builder.CreateICmpEQ(Check, Src, "truncheck");
		// If the comparison result is 'i1 false', then the truncation was lossy.

		llvm::Constant *StaticArgs[] = {
		CGF.EmitCheckSourceLocation(Loc), CGF.EmitCheckTypeDescriptor(SrcType),
		CGF.EmitCheckTypeDescriptor(DstType),
		llvm::ConstantInt::get(Builder.getInt8Ty(), ICCK_IntegerTruncation)};
		CGF.EmitCheck(std::make_pair(Check, SanitizerKind::ImplicitIntegerTruncation),
		SanitizerHandler::ImplicitConversion, StaticArgs, {Src, Dst});
}		}

		/// Emit a conversion from the specified type to the specified destination type,
		/// both of which are LLVM scalar types.
Value ScalarExprEmitter::EmitScalarConversion(Value Src, QualType SrcType,		Value ScalarExprEmitter::EmitScalarConversion(Value Src, QualType SrcType,
QualType DstType,		QualType DstType,
SourceLocation Loc,		SourceLocation Loc,
bool TreatBooleanAsSigned) {		ScalarConversionOpts Opts) {
		QualType NoncanonicalSrcType = SrcType;
		QualType NoncanonicalDstType = DstType;

SrcType = CGF.getContext().getCanonicalType(SrcType);		SrcType = CGF.getContext().getCanonicalType(SrcType);
DstType = CGF.getContext().getCanonicalType(DstType);		DstType = CGF.getContext().getCanonicalType(DstType);
if (SrcType == DstType) return Src;		if (SrcType == DstType) return Src;

if (DstType->isVoidType()) return nullptr;		if (DstType->isVoidType()) return nullptr;

llvm::Value *OrigSrc = Src;		llvm::Value *OrigSrc = Src;
QualType OrigSrcType = SrcType;		QualType OrigSrcType = SrcType;
▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines	if (SrcTy->isFloatingPointTy()) {
// If the half type is supported, just use an fptrunc.		// If the half type is supported, just use an fptrunc.
return Builder.CreateFPTrunc(Src, DstTy);		return Builder.CreateFPTrunc(Src, DstTy);
}		}
DstTy = CGF.FloatTy;		DstTy = CGF.FloatTy;
}		}

if (isa<llvm::IntegerType>(SrcTy)) {		if (isa<llvm::IntegerType>(SrcTy)) {
bool InputSigned = SrcType->isSignedIntegerOrEnumerationType();		bool InputSigned = SrcType->isSignedIntegerOrEnumerationType();
if (SrcType->isBooleanType() && TreatBooleanAsSigned) {		if (SrcType->isBooleanType() && Opts.TreatBooleanAsSigned) {
InputSigned = true;		InputSigned = true;
}		}
if (isa<llvm::IntegerType>(DstTy))		if (isa<llvm::IntegerType>(DstTy))
Res = Builder.CreateIntCast(Src, DstTy, InputSigned, "conv");		Res = Builder.CreateIntCast(Src, DstTy, InputSigned, "conv");
else if (InputSigned)		else if (InputSigned)
Res = Builder.CreateSIToFP(Src, DstTy, "conv");		Res = Builder.CreateSIToFP(Src, DstTy, "conv");
else		else
Res = Builder.CreateUIToFP(Src, DstTy, "conv");		Res = Builder.CreateUIToFP(Src, DstTy, "conv");
Show All 18 Lines	if (CGF.getContext().getTargetInfo().useFP16ConversionIntrinsics()) {
Res = Builder.CreateCall(		Res = Builder.CreateCall(
CGF.CGM.getIntrinsic(llvm::Intrinsic::convert_to_fp16, CGF.CGM.FloatTy),		CGF.CGM.getIntrinsic(llvm::Intrinsic::convert_to_fp16, CGF.CGM.FloatTy),
Res);		Res);
} else {		} else {
Res = Builder.CreateFPTrunc(Res, ResTy, "conv");		Res = Builder.CreateFPTrunc(Res, ResTy, "conv");
}		}
}		}

		if (Opts.EmitImplicitIntegerTruncationChecks)
		EmitIntegerTruncationCheck(Src, NoncanonicalSrcType, Res,
		NoncanonicalDstType, Loc);

return Res;		return Res;
}		}

/// Emit a conversion from the specified complex type to the specified		/// Emit a conversion from the specified complex type to the specified
/// destination type, where the destination type is an LLVM scalar type.		/// destination type, where the destination type is an LLVM scalar type.
Value *ScalarExprEmitter::EmitComplexToScalarConversion(		Value *ScalarExprEmitter::EmitComplexToScalarConversion(
CodeGenFunction::ComplexPairTy Src, QualType SrcTy, QualType DstTy,		CodeGenFunction::ComplexPairTy Src, QualType SrcTy, QualType DstTy,
SourceLocation Loc) {		SourceLocation Loc) {
▲ Show 20 Lines • Show All 441 Lines • ▼ Show 20 Lines	bool CodeGenFunction::ShouldNullCheckClassCastValue(const CastExpr *CE) {
}		}

return true;		return true;
}		}

// VisitCastExpr - Emit code for an explicit or implicit cast. Implicit casts		// VisitCastExpr - Emit code for an explicit or implicit cast. Implicit casts
// have to handle a more broad range of conversions than explicit casts, as they		// have to handle a more broad range of conversions than explicit casts, as they
// handle things like function to ptr-to-function decay etc.		// handle things like function to ptr-to-function decay etc.
Value ScalarExprEmitter::VisitCastExpr(CastExpr CE) {		Value ScalarExprEmitter::VisitCastExpr(CastExpr CE) {
		vskUnsubmitted Done Reply Inline Actions I think maintaining a stack of visited cast exprs in the emitter be cheaper/simpler than using ASTContext::getParents. You could push CE here and use a RAII helper to pop it. The 'isCastPartOfExplicitCast' check then simplifies to a quick stack traversal. vsk: I think maintaining a stack of visited cast exprs in the emitter be cheaper/simpler than using…
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions Hmm, two things come to mind: This pessimizes the (most popular) case when the sanitizer is disabled. `ASTContext::getParents()` may return more than one parent. I'm not sure if that matters here? I'll take a look.. lebedev.ri: Hmm, two things come to mind: 1. This pessimizes the (most popular) case when the sanitizer is…
		vskUnsubmitted Done Reply Inline Actions As for (1), it's not necessary to push/pop the stack when this sanitizer is disabled. And for (2), IIUC the only interesting case is "explicit-cast <implicit-cast>+", and none of the implicit casts here have more than one parent. vsk: As for (1), it's not necessary to push/pop the stack when this sanitizer is disabled. And for…
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions I think maintaining a stack of visited cast exprs in the emitter be cheaper/simpler than using ASTContext::getParents. broken.patch5 KBDownload So yeah, this could work. Except sadly it breaks in subtle cases like https://godbolt.org/g/5V2czU I have added those tests beforehand. Is `ASTContext::getParents()` really horribly slow so we want to duplicate/maintain/track the current AST stack ourselves? If so, we will need to maintain the entire stack, not just `CastExpr''s... lebedev.ri: > I think maintaining a stack of visited cast exprs in the emitter be cheaper/simpler than…
		vskUnsubmitted Done Reply Inline Actions I think the scan in 'IsTopCastPartOfExplicitCast' can be fixed: while traversing backwards, you'd need to check that the previously-visited cast expr is the child of the current expr. That should address the false negative you pointed out in interference1. I don't yet see what the issue is with interference0. Could you explain why maintaining a stack of unfinished casts wouldn't work? I don't understand why you'd need the entire stack. My sense is that it's not required to match the "explicit-cast <implicit-cast>+" pattern, but I could easily be missing something here. As for why this might be worth looking into, I think scanning for an explicit cast is much easier to understand when working with a stack. + @klimek to comment on what to expect in terms of the overhead of ASTContext::getParents. Regardless of what approach we pick, it would help to see pre/post-patch compile times for a stage2 build of something like clang or llc. vsk: I think the scan in 'IsTopCastPartOfExplicitCast' can be fixed: while traversing backwards…
		lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions I think the scan in 'IsTopCastPartOfExplicitCast' can be fixed: while traversing backwards, you'd need to check that the previously-visited cast expr is the child of the current expr. That should address the false negative you pointed out in interference1. Oh right. That seems to fix the issues. Could you explain why maintaining a stack of unfinished casts wouldn't work? I didn't think it wouldn't work. I just missed the tidbit about checking children. Then it works. lebedev.ri: > I think the scan in 'IsTopCastPartOfExplicitCast' can be fixed: while traversing backwards…
Expr *E = CE->getSubExpr();		Expr *E = CE->getSubExpr();
QualType DestTy = CE->getType();		QualType DestTy = CE->getType();
CastKind Kind = CE->getCastKind();		CastKind Kind = CE->getCastKind();

// These cases are generally not written to ignore the result of		// These cases are generally not written to ignore the result of
// evaluating their sub-expressions, so we clear this now.		// evaluating their sub-expressions, so we clear this now.
bool Ignored = TestAndClearIgnoreResultAssign();		bool Ignored = TestAndClearIgnoreResultAssign();

▲ Show 20 Lines • Show All 220 Lines • ▼ Show 20 Lines	Value ScalarExprEmitter::VisitCastExpr(CastExpr CE) {
case CK_VectorSplat: {		case CK_VectorSplat: {
llvm::Type *DstTy = ConvertType(DestTy);		llvm::Type *DstTy = ConvertType(DestTy);
Value Elt = Visit(const_cast<Expr>(E));		Value Elt = Visit(const_cast<Expr>(E));
// Splat the element across to all elements		// Splat the element across to all elements
unsigned NumElements = DstTy->getVectorNumElements();		unsigned NumElements = DstTy->getVectorNumElements();
return Builder.CreateVectorSplat(NumElements, Elt, "splat");		return Builder.CreateVectorSplat(NumElements, Elt, "splat");
}		}

case CK_IntegralCast:		case CK_IntegralCast: {
		ScalarConversionOpts Opts;
		if (CGF.SanOpts.has(SanitizerKind::ImplicitIntegerTruncation)) {
		if (auto *ICE = dyn_cast<ImplicitCastExpr>(CE))
		Opts.EmitImplicitIntegerTruncationChecks = !ICE->isPartOfExplicitCast();
		}
		return EmitScalarConversion(Visit(E), E->getType(), DestTy,
		CE->getExprLoc(), Opts);
		}
case CK_IntegralToFloating:		case CK_IntegralToFloating:
case CK_FloatingToIntegral:		case CK_FloatingToIntegral:
case CK_FloatingCast:		case CK_FloatingCast:
return EmitScalarConversion(Visit(E), E->getType(), DestTy,		return EmitScalarConversion(Visit(E), E->getType(), DestTy,
CE->getExprLoc());		CE->getExprLoc());
case CK_BooleanToSignedIntegral:		case CK_BooleanToSignedIntegral: {
		ScalarConversionOpts Opts;
		Opts.TreatBooleanAsSigned = true;
return EmitScalarConversion(Visit(E), E->getType(), DestTy,		return EmitScalarConversion(Visit(E), E->getType(), DestTy,
CE->getExprLoc(),		CE->getExprLoc(), Opts);
/TreatBooleanAsSigned=/true);		}
case CK_IntegralToBoolean:		case CK_IntegralToBoolean:
return EmitIntToBoolConversion(Visit(E));		return EmitIntToBoolConversion(Visit(E));
case CK_PointerToBoolean:		case CK_PointerToBoolean:
return EmitPointerToBoolConversion(Visit(E), E->getType());		return EmitPointerToBoolConversion(Visit(E), E->getType());
case CK_FloatingToBoolean:		case CK_FloatingToBoolean:
return EmitFloatToBoolConversion(Visit(E));		return EmitFloatToBoolConversion(Visit(E));
case CK_MemberPointerToBoolean: {		case CK_MemberPointerToBoolean: {
llvm::Value *MemPtr = Visit(E);		llvm::Value *MemPtr = Visit(E);
▲ Show 20 Lines • Show All 2,296 Lines • Show Last 20 Lines

lib/CodeGen/CodeGenFunction.h

Show First 20 Lines • Show All 110 Lines • ▼ Show 20 Lines
#define LIST_SANITIZER_CHECKS \		#define LIST_SANITIZER_CHECKS \
SANITIZER_CHECK(AddOverflow, add_overflow, 0) \		SANITIZER_CHECK(AddOverflow, add_overflow, 0) \
SANITIZER_CHECK(BuiltinUnreachable, builtin_unreachable, 0) \		SANITIZER_CHECK(BuiltinUnreachable, builtin_unreachable, 0) \
SANITIZER_CHECK(CFICheckFail, cfi_check_fail, 0) \		SANITIZER_CHECK(CFICheckFail, cfi_check_fail, 0) \
SANITIZER_CHECK(DivremOverflow, divrem_overflow, 0) \		SANITIZER_CHECK(DivremOverflow, divrem_overflow, 0) \
SANITIZER_CHECK(DynamicTypeCacheMiss, dynamic_type_cache_miss, 0) \		SANITIZER_CHECK(DynamicTypeCacheMiss, dynamic_type_cache_miss, 0) \
SANITIZER_CHECK(FloatCastOverflow, float_cast_overflow, 0) \		SANITIZER_CHECK(FloatCastOverflow, float_cast_overflow, 0) \
SANITIZER_CHECK(FunctionTypeMismatch, function_type_mismatch, 0) \		SANITIZER_CHECK(FunctionTypeMismatch, function_type_mismatch, 0) \
		SANITIZER_CHECK(ImplicitConversion, implicit_conversion, 0) \
SANITIZER_CHECK(InvalidBuiltin, invalid_builtin, 0) \		SANITIZER_CHECK(InvalidBuiltin, invalid_builtin, 0) \
SANITIZER_CHECK(LoadInvalidValue, load_invalid_value, 0) \		SANITIZER_CHECK(LoadInvalidValue, load_invalid_value, 0) \
SANITIZER_CHECK(MissingReturn, missing_return, 0) \		SANITIZER_CHECK(MissingReturn, missing_return, 0) \
SANITIZER_CHECK(MulOverflow, mul_overflow, 0) \		SANITIZER_CHECK(MulOverflow, mul_overflow, 0) \
SANITIZER_CHECK(NegateOverflow, negate_overflow, 0) \		SANITIZER_CHECK(NegateOverflow, negate_overflow, 0) \
SANITIZER_CHECK(NullabilityArg, nullability_arg, 0) \		SANITIZER_CHECK(NullabilityArg, nullability_arg, 0) \
SANITIZER_CHECK(NullabilityReturn, nullability_return, 1) \		SANITIZER_CHECK(NullabilityReturn, nullability_return, 1) \
SANITIZER_CHECK(NonnullArg, nonnull_arg, 0) \		SANITIZER_CHECK(NonnullArg, nonnull_arg, 0) \
▲ Show 20 Lines • Show All 328 Lines • ▼ Show 20 Lines	class SanitizerScope {
CodeGenFunction *CGF;		CodeGenFunction *CGF;
public:		public:
SanitizerScope(CodeGenFunction *CGF);		SanitizerScope(CodeGenFunction *CGF);
~SanitizerScope();		~SanitizerScope();
};		};

/// In C++, whether we are code generating a thunk. This controls whether we		/// In C++, whether we are code generating a thunk. This controls whether we
/// should emit cleanups.		/// should emit cleanups.
bool CurFuncIsThunk = false;		bool CurFuncIsThunk = false;
		vskUnsubmitted Done Reply Inline Actions Why not 0 instead of 8, given that in the common case, this stack is unused? vsk: Why not 0 instead of 8, given that in the common case, this stack is unused?
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions No longer relevant. lebedev.ri: No longer relevant.
		vskUnsubmitted Done Reply Inline Actions I'm referring to CastExprStack within ScalarExprEmitter, which still allocates space for 8 pointers inline. vsk: I'm referring to CastExprStack within ScalarExprEmitter, which still allocates space for 8…
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions Ah, you mean in the general case when the sanitizer is disabled? lebedev.ri: Ah, you mean in the general case when the sanitizer is disabled?
		vskUnsubmitted Done Reply Inline Actions Yes. It's a relatively minor concern, but clang's stack can get pretty deep inside of CodeGenFunction. At one point we needed to outline code by hand to unbreak the ASan build. Later I think we just increased the stack size rlimit. I don't see a countervailing performance benefit of allocating more space inline, at least not here. vsk: Yes. It's a relatively minor concern, but clang's stack can get pretty deep inside of…
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions No, i agree and totally understand. I just didn't think about that sanitizer-less context. lebedev.ri: No, i agree and totally understand. I just didn't think about that sanitizer-less context.

/// In ARC, whether we should autorelease the return value.		/// In ARC, whether we should autorelease the return value.
bool AutoreleaseResult = false;		bool AutoreleaseResult = false;

/// Whether we processed a Microsoft-style asm block during CodeGen. These can		/// Whether we processed a Microsoft-style asm block during CodeGen. These can
		vskUnsubmitted Done Reply Inline Actions I'm not sure the cost of maintaining an extra vector is worth the benefit of the added assertion. Wouldn't it be cheaper to just store the number of pushed casts? You'd only need one constructor which accepts an ArrayRef<const CastExpr >. vsk:* I'm not sure the cost of maintaining an extra vector is worth the benefit of the added…
		lebedev.riAuthorUnsubmitted Done Reply Inline Actions No longer relevant. lebedev.ri: No longer relevant.
/// potentially set the return value.		/// potentially set the return value.
bool SawAsmBlock = false;		bool SawAsmBlock = false;

const FunctionDecl *CurSEHParent = nullptr;		const FunctionDecl *CurSEHParent = nullptr;

/// True if the current function is an outlined SEH helper. This can be a		/// True if the current function is an outlined SEH helper. This can be a
/// finally block or filter expression.		/// finally block or filter expression.
bool IsOutlinedSEHHelper = false;		bool IsOutlinedSEHHelper = false;
▲ Show 20 Lines • Show All 3,861 Lines • Show Last 20 Lines

lib/Driver/SanitizerArgs.cpp

	Show All 21 Lines
	#include <memory>			#include <memory>

	using namespace clang;			using namespace clang;
	using namespace clang::SanitizerKind;			using namespace clang::SanitizerKind;
	using namespace clang::driver;			using namespace clang::driver;
	using namespace llvm::opt;			using namespace llvm::opt;

	enum : SanitizerMask {			enum : SanitizerMask {
	NeedsUbsanRt = Undefined \| Integer \| Nullability \| CFI,			NeedsUbsanRt = Undefined \| Integer \| ImplicitConversion \| Nullability \| CFI,
	NeedsUbsanCxxRt = Vptr \| CFI,			NeedsUbsanCxxRt = Vptr \| CFI,
	NotAllowedWithTrap = Vptr,			NotAllowedWithTrap = Vptr,
	NotAllowedWithMinimalRuntime = Vptr,			NotAllowedWithMinimalRuntime = Vptr,
	RequiresPIE = DataFlow \| HWAddress \| Scudo,			RequiresPIE = DataFlow \| HWAddress \| Scudo,
	NeedsUnwindTables = Address \| HWAddress \| Thread \| Memory \| DataFlow,			NeedsUnwindTables = Address \| HWAddress \| Thread \| Memory \| DataFlow,
	SupportsCoverage = Address \| HWAddress \| KernelAddress \| KernelHWAddress \|			SupportsCoverage = Address \| HWAddress \| KernelAddress \| KernelHWAddress \|
	Memory \| Leak \| Undefined \| Integer \| Nullability \|			Memory \| Leak \| Undefined \| Integer \| ImplicitConversion \|
	DataFlow \| Fuzzer \| FuzzerNoLink,			Nullability \| DataFlow \| Fuzzer \| FuzzerNoLink,
	RecoverableByDefault = Undefined \| Integer \| Nullability,			RecoverableByDefault = Undefined \| Integer \| ImplicitConversion \| Nullability,
	Unrecoverable = Unreachable \| Return,			Unrecoverable = Unreachable \| Return,
	AlwaysRecoverable = KernelAddress \| KernelHWAddress,			AlwaysRecoverable = KernelAddress \| KernelHWAddress,
	LegacyFsanitizeRecoverMask = Undefined \| Integer,			LegacyFsanitizeRecoverMask = Undefined \| Integer,
	NeedsLTO = CFI,			NeedsLTO = CFI,
	TrappingSupported = (Undefined & ~Vptr) \| UnsignedIntegerOverflow \|			TrappingSupported = (Undefined & ~Vptr) \| UnsignedIntegerOverflow \|
	Nullability \| LocalBounds \| CFI,			ImplicitConversion \| Nullability \| LocalBounds \| CFI,
	TrappingDefault = CFI,			TrappingDefault = CFI,
	CFIClasses =			CFIClasses =
	CFIVCall \| CFINVCall \| CFIMFCall \| CFIDerivedCast \| CFIUnrelatedCast,			CFIVCall \| CFINVCall \| CFIMFCall \| CFIDerivedCast \| CFIUnrelatedCast,
	CompatibleWithMinimalRuntime = TrappingSupported \| Scudo,			CompatibleWithMinimalRuntime = TrappingSupported \| Scudo,
	};			};

	enum CoverageFeature {			enum CoverageFeature {
	CoverageFunc = 1 << 0,			CoverageFunc = 1 << 0,
	▲ Show 20 Lines • Show All 967 Lines • Show Last 20 Lines

lib/Driver/ToolChain.cpp

	Show First 20 Lines • Show All 797 Lines • ▼ Show 20 Lines

	SanitizerMask ToolChain::getSupportedSanitizers() const {			SanitizerMask ToolChain::getSupportedSanitizers() const {
	// Return sanitizers which don't require runtime support and are not			// Return sanitizers which don't require runtime support and are not
	// platform dependent.			// platform dependent.

	using namespace SanitizerKind;			using namespace SanitizerKind;

	SanitizerMask Res = (Undefined & ~Vptr & ~Function) \| (CFI & ~CFIICall) \|			SanitizerMask Res = (Undefined & ~Vptr & ~Function) \| (CFI & ~CFIICall) \|
	CFICastStrict \| UnsignedIntegerOverflow \| Nullability \|			CFICastStrict \| UnsignedIntegerOverflow \|
	LocalBounds;			ImplicitConversion \| Nullability \| LocalBounds;
	if (getTriple().getArch() == llvm::Triple::x86 \|\|			if (getTriple().getArch() == llvm::Triple::x86 \|\|
	getTriple().getArch() == llvm::Triple::x86_64 \|\|			getTriple().getArch() == llvm::Triple::x86_64 \|\|
	getTriple().getArch() == llvm::Triple::arm \|\|			getTriple().getArch() == llvm::Triple::arm \|\|
	getTriple().getArch() == llvm::Triple::aarch64 \|\|			getTriple().getArch() == llvm::Triple::aarch64 \|\|
	getTriple().getArch() == llvm::Triple::wasm32 \|\|			getTriple().getArch() == llvm::Triple::wasm32 \|\|
	getTriple().getArch() == llvm::Triple::wasm64)			getTriple().getArch() == llvm::Triple::wasm64)
	Res \|= CFIICall;			Res \|= CFIICall;
	if (getTriple().getArch() == llvm::Triple::x86_64 \|\|			if (getTriple().getArch() == llvm::Triple::x86_64 \|\|
	▲ Show 20 Lines • Show All 131 Lines • Show Last 20 Lines

test/CodeGen/catch-implicit-integer-truncations.c

				// RUN: %clang_cc1 -emit-llvm %s -o - -triple x86_64-linux-gnu \| FileCheck %s --check-prefix=CHECK
				// RUN: %clang_cc1 -fsanitize=implicit-integer-truncation -fno-sanitize-recover=implicit-integer-truncation -emit-llvm %s -o - -triple x86_64-linux-gnu \| FileCheck %s --check-prefixes=CHECK,CHECK-SANITIZE,CHECK-SANITIZE-ANYRECOVER,CHECK-SANITIZE-NORECOVER
				// RUN: %clang_cc1 -fsanitize=implicit-integer-truncation -fsanitize-recover=implicit-integer-truncation -emit-llvm %s -o - -triple x86_64-linux-gnu \| FileCheck %s --check-prefixes=CHECK,CHECK-SANITIZE,CHECK-SANITIZE-ANYRECOVER,CHECK-SANITIZE-RECOVER
				// RUN: %clang_cc1 -fsanitize=implicit-integer-truncation -fsanitize-trap=implicit-integer-truncation -emit-llvm %s -o - -triple x86_64-linux-gnu \| FileCheck %s --check-prefixes=CHECK,CHECK-SANITIZE,CHECK-SANITIZE-TRAP

				// CHECK-SANITIZE-ANYRECOVER: @[[UNSIGNED_INT:.]] = {{.}} c"'unsigned int'\00" }
				// CHECK-SANITIZE-ANYRECOVER: @[[UNSIGNED_CHAR:.]] = {{.}} c"'unsigned char'\00" }

				// CHECK-SANITIZE-ANYRECOVER: @[[LINE_100:.]] = {{.}}, i32 100, i32 10 }, {{.}} @[[UNSIGNED_INT]], {{.}} @[[UNSIGNED_CHAR]], i8 0 }
				// CHECK-SANITIZE-ANYRECOVER: @[[SIGNED_INT:.]] = {{.}} c"'int'\00" }
				// CHECK-SANITIZE-ANYRECOVER: @[[LINE_200:.]] = {{.}}, i32 200, i32 10 }, {{.}} @[[SIGNED_INT]], {{.}} @[[UNSIGNED_CHAR]], i8 0 }
				// CHECK-SANITIZE-ANYRECOVER: @[[SIGNED_CHAR:.]] = {{.}} c"'signed char'\00" }
				// CHECK-SANITIZE-ANYRECOVER: @[[LINE_300:.]] = {{.}}, i32 300, i32 10 }, {{.}} @[[UNSIGNED_INT]], {{.}} @[[SIGNED_CHAR]], i8 0 }
				// CHECK-SANITIZE-ANYRECOVER: @[[LINE_400:.]] = {{.}}, i32 400, i32 10 }, {{.}} @[[SIGNED_INT]], {{.}} @[[SIGNED_CHAR]], i8 0 }

				// CHECK-SANITIZE-ANYRECOVER: @[[UINT32:.]] = {{.}} c"'uint32_t' (aka 'unsigned int')\00" }
				// CHECK-SANITIZE-ANYRECOVER: @[[UINT8:.]] = {{.}} c"'uint8_t' (aka 'unsigned char')\00" }
				// CHECK-SANITIZE-ANYRECOVER: @[[LINE_500:.]] = {{.}}, i32 500, i32 10 }, {{.}} @[[UINT32]], {{.}} @[[UINT8]], i8 0 }

				// ========================================================================== //
				// The expected true-positives. These are implicit conversions, and they truncate.
				// ========================================================================== //

				// CHECK-LABEL: @unsigned_int_to_unsigned_char
				unsigned char unsigned_int_to_unsigned_char(unsigned int src) {
				// CHECK: %[[DST:.]] = trunc i32 %[[SRC:.]] to i8
				// CHECK-SANITIZE-NEXT: %[[ANYEXT:.*]] = zext i8 %[[DST]] to i32, !nosanitize
				// CHECK-SANITIZE-NEXT: %[[TRUNCHECK:.*]] = icmp eq i32 %[[ANYEXT]], %[[SRC]], !nosanitize
				// CHECK-SANITIZE-NEXT: br i1 %[[TRUNCHECK]], label %[[CONT:.]], label %[[HANDLER_IMPLICIT_CONVERSION:[^,]+]],{{.}} !nosanitize
				// CHECK-SANITIZE: [[HANDLER_IMPLICIT_CONVERSION]]:
				vskUnsubmitted Done Reply Inline Actions There's no need to check the profile metadata here. vsk: There's no need to check the profile metadata here.
				lebedev.riAuthorUnsubmitted Done Reply Inline Actions I was checking it because otherwise `HANDLER_IMPLICIT_CAST` would have over-eagerly consumed `, !prof !3` too. But there is actually a way around that.. lebedev.ri: I was checking it because otherwise `HANDLER_IMPLICIT_CAST` would have over-eagerly consumed `…
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTSRC:.*]] = zext i32 %[[SRC]] to i64, !nosanitize
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTDST:.*]] = zext i8 %[[DST]] to i64, !nosanitize
				// CHECK-SANITIZE-NORECOVER-NEXT: call void @__ubsan_handle_implicit_conversion_abort(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_100]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-RECOVER-NEXT: call void @__ubsan_handle_implicit_conversion(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_100]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: call void @llvm.trap(){{.*}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: unreachable, !nosanitize
				// CHECK-SANITIZE: [[CONT]]:
				// CHECK: ret i8 %[[DST]]
				#line 100
				return src;
				}

				// CHECK-LABEL: @signed_int_to_unsigned_char
				unsigned char signed_int_to_unsigned_char(signed int src) {
				// CHECK: %[[DST:.]] = trunc i32 %[[SRC:.]] to i8
				// CHECK-SANITIZE-NEXT: %[[ANYEXT:.*]] = zext i8 %[[DST]] to i32, !nosanitize
				// CHECK-SANITIZE-NEXT: %[[TRUNCHECK:.*]] = icmp eq i32 %[[ANYEXT]], %[[SRC]], !nosanitize
				// CHECK-SANITIZE-NEXT: br i1 %[[TRUNCHECK]], label %[[CONT:.]], label %[[HANDLER_IMPLICIT_CONVERSION:[^,]+]],{{.}} !nosanitize
				// CHECK-SANITIZE: [[HANDLER_IMPLICIT_CONVERSION]]:
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTSRC:.*]] = zext i32 %[[SRC]] to i64, !nosanitize
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTDST:.*]] = zext i8 %[[DST]] to i64, !nosanitize
				// CHECK-SANITIZE-NORECOVER-NEXT: call void @__ubsan_handle_implicit_conversion_abort(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_200]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-RECOVER-NEXT: call void @__ubsan_handle_implicit_conversion(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_200]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: call void @llvm.trap(){{.*}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: unreachable, !nosanitize
				// CHECK-SANITIZE: [[CONT]]:
				// CHECK: ret i8 %[[DST]]
				#line 200
				return src;
				}

				// CHECK-LABEL: @unsigned_int_to_signed_char
				signed char unsigned_int_to_signed_char(unsigned int src) {
				// CHECK: %[[DST:.]] = trunc i32 %[[SRC:.]] to i8
				// CHECK-SANITIZE-NEXT: %[[ANYEXT:.*]] = sext i8 %[[DST]] to i32, !nosanitize
				// CHECK-SANITIZE-NEXT: %[[TRUNCHECK:.*]] = icmp eq i32 %[[ANYEXT]], %[[SRC]], !nosanitize
				// CHECK-SANITIZE-NEXT: br i1 %[[TRUNCHECK]], label %[[CONT:.]], label %[[HANDLER_IMPLICIT_CONVERSION:[^,]+]],{{.}} !nosanitize
				// CHECK-SANITIZE: [[HANDLER_IMPLICIT_CONVERSION]]:
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTSRC:.*]] = zext i32 %[[SRC]] to i64, !nosanitize
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTDST:.*]] = zext i8 %[[DST]] to i64, !nosanitize
				// CHECK-SANITIZE-NORECOVER-NEXT: call void @__ubsan_handle_implicit_conversion_abort(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_300]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-RECOVER-NEXT: call void @__ubsan_handle_implicit_conversion(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_300]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: call void @llvm.trap(){{.*}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: unreachable, !nosanitize
				// CHECK-SANITIZE: [[CONT]]:
				// CHECK: ret i8 %[[DST]]
				#line 300
				return src;
				}

				// CHECK-LABEL: @signed_int_to_signed_char
				signed char signed_int_to_signed_char(signed int src) {
				// CHECK: %[[DST:.]] = trunc i32 %[[SRC:.]] to i8
				// CHECK-SANITIZE-NEXT: %[[ANYEXT:.*]] = sext i8 %[[DST]] to i32, !nosanitize
				// CHECK-SANITIZE-NEXT: %[[TRUNCHECK:.*]] = icmp eq i32 %[[ANYEXT]], %[[SRC]], !nosanitize
				// CHECK-SANITIZE-NEXT: br i1 %[[TRUNCHECK]], label %[[CONT:.]], label %[[HANDLER_IMPLICIT_CONVERSION:[^,]+]],{{.}} !nosanitize
				// CHECK-SANITIZE: [[HANDLER_IMPLICIT_CONVERSION]]:
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTSRC:.*]] = zext i32 %[[SRC]] to i64, !nosanitize
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTDST:.*]] = zext i8 %[[DST]] to i64, !nosanitize
				// CHECK-SANITIZE-NORECOVER-NEXT: call void @__ubsan_handle_implicit_conversion_abort(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_400]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-RECOVER-NEXT: call void @__ubsan_handle_implicit_conversion(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_400]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: call void @llvm.trap(){{.*}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: unreachable, !nosanitize
				// CHECK-SANITIZE: [[CONT]]:
				// CHECK: ret i8 %[[DST]]
				#line 400
				return src;
				}

				// ========================================================================== //
				// Check canonical type stuff
				// ========================================================================== //

				typedef unsigned int uint32_t;
				typedef unsigned char uint8_t;

				// CHECK-LABEL: @uint32_to_uint8
				uint8_t uint32_to_uint8(uint32_t src) {
				// CHECK: %[[DST:.]] = trunc i32 %[[SRC:.]] to i8
				// CHECK-SANITIZE-NEXT: %[[ANYEXT:.*]] = zext i8 %[[DST]] to i32, !nosanitize
				// CHECK-SANITIZE-NEXT: %[[TRUNCHECK:.*]] = icmp eq i32 %[[ANYEXT]], %[[SRC]], !nosanitize
				// CHECK-SANITIZE-NEXT: br i1 %[[TRUNCHECK]], label %[[CONT:.]], label %[[HANDLER_IMPLICIT_CONVERSION:[^,]+]],{{.}} !nosanitize
				// CHECK-SANITIZE: [[HANDLER_IMPLICIT_CONVERSION]]:
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTSRC:.*]] = zext i32 %[[SRC]] to i64, !nosanitize
				// CHECK-SANITIZE-ANYRECOVER-NEXT: %[[EXTDST:.*]] = zext i8 %[[DST]] to i64, !nosanitize
				// CHECK-SANITIZE-NORECOVER-NEXT: call void @__ubsan_handle_implicit_conversion_abort(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_500]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-RECOVER-NEXT: call void @__ubsan_handle_implicit_conversion(i8* bitcast ({ {{{.}}}, {{{.}}}, {{{.}}}, i8 } @[[LINE_500]] to i8), i64 %[[EXTSRC]], i64 %[[EXTDST]]){{.}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: call void @llvm.trap(){{.*}}, !nosanitize
				// CHECK-SANITIZE-TRAP-NEXT: unreachable, !nosanitize
				// CHECK-SANITIZE: [[CONT]]:
				// CHECK: ret i8 %[[DST]]
				#line 500
				return src;
				}

				// ========================================================================== //
				// Check that explicit conversion does not interfere with implicit conversion
				// ========================================================================== //
				// These contain one implicit truncating conversion, and one explicit truncating conversion.
				// We want to make sure that we still diagnose the implicit conversion.

				// Implicit truncation after explicit truncation.
				// CHECK-LABEL: @explicit_conversion_interference0
				unsigned char explicit_conversion_interference0(unsigned int c) {
				// CHECK-SANITIZE: %[[ANYEXT:.]] = zext i8 %[[DST:.]] to i16, !nosanitize
				// CHECK-SANITIZE: call
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned short)c;
				}

				// Implicit truncation before explicit truncation.
				// CHECK-LABEL: @explicit_conversion_interference1
				unsigned char explicit_conversion_interference1(unsigned int c) {
				// CHECK-SANITIZE: %[[ANYEXT:.]] = zext i16 %[[DST:.]] to i32, !nosanitize
				// CHECK-SANITIZE: call
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				unsigned short b;
				return (unsigned char)(b = c);
				}

				// ========================================================================== //
				// The expected true-negatives.
				// ========================================================================== //

				// Sanitization is explicitly disabled.
				// ========================================================================== //

				// CHECK-LABEL: @blacklist_0
				vskUnsubmitted Done Reply Inline Actions nit, aren't these true-negatives because we expect to see no errors? vsk: nit, aren't these true-negatives because we expect to see no errors?
				lebedev.riAuthorUnsubmitted Done Reply Inline Actions Right. lebedev.ri: Right.
				__attribute__((no_sanitize("undefined"))) unsigned char blacklist_0(unsigned int src) {
				// We are not in "undefined" group, so that doesn't work.
				// CHECK-SANITIZE: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @blacklist_1
				__attribute__((no_sanitize("implicit-conversion"))) unsigned char blacklist_1(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @blacklist_2
				__attribute__((no_sanitize("implicit-integer-truncation"))) unsigned char blacklist_2(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// Explicit truncating conversions.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_unsigned_int_to_unsigned_char
				unsigned char explicit_unsigned_int_to_unsigned_char(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned char)src;
				}

				// CHECK-LABEL: @explicit_signed_int_to_unsigned_char
				unsigned char explicit_signed_int_to_unsigned_char(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned char)src;
				}

				// CHECK-LABEL: @explicit_unsigned_int_to_signed_char
				signed char explicit_unsigned_int_to_signed_char(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed char)src;
				}

				// CHECK-LABEL: @explicit_signed_int_to_signed_char
				signed char explicit_signed_int_to_signed_char(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed char)src;
				}

				// Explicit NOP conversions.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_unsigned_int_to_unsigned_int
				unsigned int explicit_unsigned_int_to_unsigned_int(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned int)src;
				}

				// CHECK-LABEL: @explicit_signed_int_to_signed_int
				signed int explicit_signed_int_to_signed_int(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed int)src;
				}

				// CHECK-LABEL: @explicit_unsigned_char_to_signed_char
				unsigned char explicit_unsigned_char_to_signed_char(unsigned char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned char)src;
				}

				// CHECK-LABEL: @explicit_signed_char_to_signed_char
				signed char explicit_signed_char_to_signed_char(signed char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed char)src;
				}

				// upcasts.
				// ========================================================================== //

				// CHECK-LABEL: @unsigned_char_to_unsigned_int
				unsigned int unsigned_char_to_unsigned_int(unsigned char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @signed_char_to_unsigned_int
				unsigned int signed_char_to_unsigned_int(signed char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @unsigned_char_to_signed_int
				signed int unsigned_char_to_signed_int(unsigned char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @signed_char_to_signed_int
				signed int signed_char_to_signed_int(signed char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// Explicit upcasts.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_unsigned_char_to_unsigned_int
				unsigned int explicit_unsigned_char_to_unsigned_int(unsigned char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned int)src;
				}

				// CHECK-LABEL: @explicit_signed_char_to_unsigned_int
				unsigned int explicit_signed_char_to_unsigned_int(signed char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned int)src;
				}

				// CHECK-LABEL: @explicit_unsigned_char_to_signed_int
				signed int explicit_unsigned_char_to_signed_int(unsigned char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed int)src;
				}

				// CHECK-LABEL: @explicit_signed_char_to_signed_int
				signed int explicit_signed_char_to_signed_int(signed char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed int)src;
				}

				// conversions to to boolean type are not counted as truncation.
				// ========================================================================== //

				// CHECK-LABEL: @unsigned_int_to_bool
				_Bool unsigned_int_to_bool(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @signed_int_to_bool
				_Bool signed_int_to_bool(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @explicit_unsigned_int_to_bool
				_Bool explicit_unsigned_int_to_bool(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (_Bool)src;
				}

				// CHECK-LABEL: @explicit_signed_int_to_bool
				_Bool explicit_signed_int_to_bool(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (_Bool)src;
				}

				// Explicit truncating conversions from pointer to a much-smaller integer.
				// Can not have an implicit conversion from pointer to an integer.
				// Can not have an implicit conversion between two enums.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_voidptr_to_unsigned_char
				unsigned char explicit_voidptr_to_unsigned_char(void *src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned char)src;
				}

				// CHECK-LABEL: @explicit_voidptr_to_signed_char
				signed char explicit_voidptr_to_signed_char(void *src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed char)src;
				}

				// Implicit truncating conversions from floating-point may result in precision loss.
				// ========================================================================== //

				// CHECK-LABEL: @float_to_unsigned_int
				unsigned int float_to_unsigned_int(float src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @float_to_signed_int
				signed int float_to_signed_int(float src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @double_to_unsigned_int
				unsigned int double_to_unsigned_int(double src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// CHECK-LABEL: @double_to_signed_int
				signed int double_to_signed_int(double src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

				// Implicit truncating conversions between fp may result in precision loss.
				// ========================================================================== //

				// CHECK-LABEL: @double_to_float
				float double_to_float(double src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return src;
				}

test/CodeGenCXX/catch-implicit-integer-truncations.cpp

				// RUN: %clang_cc1 -emit-llvm %s -o - -triple x86_64-linux-gnu \| FileCheck %s --check-prefix=CHECK
				// RUN: %clang_cc1 -fsanitize=implicit-integer-truncation -fno-sanitize-recover=implicit-integer-truncation -emit-llvm %s -o - -triple x86_64-linux-gnu \| FileCheck %s --check-prefixes=CHECK,CHECK-SANITIZE,CHECK-SANITIZE-ANYRECOVER,CHECK-SANITIZE-NORECOVER
				// RUN: %clang_cc1 -fsanitize=implicit-integer-truncation -fsanitize-recover=implicit-integer-truncation -emit-llvm %s -o - -triple x86_64-linux-gnu \| FileCheck %s --check-prefixes=CHECK,CHECK-SANITIZE,CHECK-SANITIZE-ANYRECOVER,CHECK-SANITIZE-RECOVER
				// RUN: %clang_cc1 -fsanitize=implicit-integer-truncation -fsanitize-trap=implicit-integer-truncation -emit-llvm %s -o - -triple x86_64-linux-gnu \| FileCheck %s --check-prefixes=CHECK,CHECK-SANITIZE,CHECK-SANITIZE-TRAP

				extern "C" { // Disable name mangling.

				// ========================================================================== //
				// Check that explicit cast does not interfere with implicit conversion
				// ========================================================================== //
				// These contain one implicit truncating conversion, and one explicit truncating cast.
				// We want to make sure that we still diagnose the implicit conversion.

				// Implicit truncation after explicit truncation.
				// CHECK-LABEL: @explicit_cast_interference0
				unsigned char explicit_cast_interference0(unsigned int c) {
				// CHECK-SANITIZE: %[[ANYEXT:.]] = zext i8 %[[DST:.]] to i16, !nosanitize
				// CHECK-SANITIZE: call
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned short)c;
				}

				// Implicit truncation before explicit truncation.
				// CHECK-LABEL: @explicit_cast_interference1
				unsigned char explicit_cast_interference1(unsigned int c) {
				// CHECK-SANITIZE: %[[ANYEXT:.]] = zext i16 %[[DST:.]] to i32, !nosanitize
				// CHECK-SANITIZE: call
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				unsigned short b;
				return (unsigned char)(b = c);
				}

				lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions @rsmith these tests should be equivalent to what you have brought up, so that situation was already tested. lebedev.ri: @rsmith these tests //should// be equivalent to what you have brought up, so that situation was…
				// ========================================================================== //
				// The expected true-negatives.
				// ========================================================================== //

				// Explicit truncating casts.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_unsigned_int_to_unsigned_char
				unsigned char explicit_unsigned_int_to_unsigned_char(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned char)src;
				}

				// CHECK-LABEL: @explicit_signed_int_to_unsigned_char
				unsigned char explicit_signed_int_to_unsigned_char(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned char)src;
				}

				// CHECK-LABEL: @explicit_unsigned_int_to_signed_char
				signed char explicit_unsigned_int_to_signed_char(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed char)src;
				}

				// CHECK-LABEL: @explicit_signed_int_to_signed_char
				signed char explicit_signed_int_to_signed_char(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed char)src;
				}

				// Explicit NOP casts.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_unsigned_int_to_unsigned_int
				unsigned int explicit_unsigned_int_to_unsigned_int(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned int)src;
				}

				// CHECK-LABEL: @explicit_signed_int_to_signed_int
				signed int explicit_signed_int_to_signed_int(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed int)src;
				}

				// CHECK-LABEL: @explicit_unsigned_char_to_signed_char
				unsigned char explicit_unsigned_char_to_signed_char(unsigned char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (unsigned char)src;
				}

				// CHECK-LABEL: @explicit_signed_char_to_signed_char
				signed char explicit_signed_char_to_signed_char(signed char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return (signed char)src;
				}

				// Explicit functional truncating casts.
				// ========================================================================== //

				using UnsignedChar = unsigned char;
				using SignedChar = signed char;
				using UnsignedInt = unsigned int;
				using SignedInt = signed int;

				// CHECK-LABEL: @explicit_functional_unsigned_int_to_unsigned_char
				unsigned char explicit_functional_unsigned_int_to_unsigned_char(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return UnsignedChar(src);
				}

				// CHECK-LABEL: @explicit_functional_signed_int_to_unsigned_char
				unsigned char explicit_functional_signed_int_to_unsigned_char(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return UnsignedChar(src);
				}

				// CHECK-LABEL: @explicit_functional_unsigned_int_to_signed_char
				signed char explicit_functional_unsigned_int_to_signed_char(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return SignedChar(src);
				}

				// CHECK-LABEL: @explicit_functional_signed_int_to_signed_char
				signed char explicit_functional_signed_int_to_signed_char(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return SignedChar(src);
				}

				// Explicit functional NOP casts.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_functional_unsigned_int_to_unsigned_int
				unsigned int explicit_functional_unsigned_int_to_unsigned_int(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return UnsignedInt(src);
				}

				// CHECK-LABEL: @explicit_functional_signed_int_to_signed_int
				signed int explicit_functional_signed_int_to_signed_int(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return SignedInt(src);
				}

				// CHECK-LABEL: @explicit_functional_unsigned_char_to_signed_char
				unsigned char explicit_functional_unsigned_char_to_signed_char(unsigned char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return UnsignedChar(src);
				}

				// CHECK-LABEL: @explicit_functional_signed_char_to_signed_char
				signed char explicit_functional_signed_char_to_signed_char(signed char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return SignedChar(src);
				}

				// Explicit C++-style casts truncating casts.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_cppstyleunsigned_int_to_unsigned_char
				unsigned char explicit_cppstyleunsigned_int_to_unsigned_char(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return static_cast<unsigned char>(src);
				}

				// CHECK-LABEL: @explicit_cppstylesigned_int_to_unsigned_char
				unsigned char explicit_cppstylesigned_int_to_unsigned_char(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return static_cast<unsigned char>(src);
				}

				// CHECK-LABEL: @explicit_cppstyleunsigned_int_to_signed_char
				signed char explicit_cppstyleunsigned_int_to_signed_char(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return static_cast<signed char>(src);
				}

				// CHECK-LABEL: @explicit_cppstylesigned_int_to_signed_char
				signed char explicit_cppstylesigned_int_to_signed_char(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return static_cast<signed char>(src);
				}

				// Explicit C++-style casts NOP casts.
				// ========================================================================== //

				// CHECK-LABEL: @explicit_cppstyleunsigned_int_to_unsigned_int
				unsigned int explicit_cppstyleunsigned_int_to_unsigned_int(unsigned int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return static_cast<unsigned int>(src);
				}

				// CHECK-LABEL: @explicit_cppstylesigned_int_to_signed_int
				signed int explicit_cppstylesigned_int_to_signed_int(signed int src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return static_cast<signed int>(src);
				}

				// CHECK-LABEL: @explicit_cppstyleunsigned_char_to_signed_char
				unsigned char explicit_cppstyleunsigned_char_to_signed_char(unsigned char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return static_cast<unsigned char>(src);
				}

				// CHECK-LABEL: @explicit_cppstylesigned_char_to_signed_char
				signed char explicit_cppstylesigned_char_to_signed_char(signed char src) {
				// CHECK-SANITIZE-NOT: call
				// CHECK: }
				return static_cast<signed char>(src);
				}

				} // extern "C"

				// ---------------------------------------------------------------------------//
				// A problematic true-negative involving simple C++ code.
				// The problem is tha the NoOp ExplicitCast is directly within MaterializeTemporaryExpr(),
				// so a special care is neeeded.
				// See https://reviews.llvm.org/D48958#1161345
				template <typename a>
				a b(a c, const a &d) {
				if (d)
				;
				return c;
				}

				extern "C" { // Disable name mangling.

				// CHECK-LABEL: @false_positive_with_MaterializeTemporaryExpr
				int false_positive_with_MaterializeTemporaryExpr() {
				// CHECK-SANITIZE-NOT: call{{.*}}ubsan
				// CHECK: }
				int e = b<unsigned>(4, static_cast<unsigned>(4294967296));
				return e;
				}

				// ---------------------------------------------------------------------------//

				} // extern "C"

test/Driver/fsanitize.c

	Show All 23 Lines
	// CHECK-UNDEFINED-WIN-SAME: "-fsanitize={{((signed-integer-overflow\|integer-divide-by-zero\|float-divide-by-zero\|shift-base\|shift-exponent\|unreachable\|return\|vla-bound\|alignment\|null\|pointer-overflow\|float-cast-overflow\|array-bounds\|enum\|bool\|builtin\|returns-nonnull-attribute\|nonnull-attribute),?){18}"}}			// CHECK-UNDEFINED-WIN-SAME: "-fsanitize={{((signed-integer-overflow\|integer-divide-by-zero\|float-divide-by-zero\|shift-base\|shift-exponent\|unreachable\|return\|vla-bound\|alignment\|null\|pointer-overflow\|float-cast-overflow\|array-bounds\|enum\|bool\|builtin\|returns-nonnull-attribute\|nonnull-attribute),?){18}"}}

	// RUN: %clang -target i386-pc-win32 -fsanitize-coverage=bb %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-COVERAGE-WIN32			// RUN: %clang -target i386-pc-win32 -fsanitize-coverage=bb %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-COVERAGE-WIN32
	// CHECK-COVERAGE-WIN32: "--dependent-lib={{[^"]*}}ubsan_standalone-i386.lib"			// CHECK-COVERAGE-WIN32: "--dependent-lib={{[^"]*}}ubsan_standalone-i386.lib"
	// RUN: %clang -target x86_64-pc-win32 -fsanitize-coverage=bb %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-COVERAGE-WIN64			// RUN: %clang -target x86_64-pc-win32 -fsanitize-coverage=bb %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-COVERAGE-WIN64
	// CHECK-COVERAGE-WIN64: "--dependent-lib={{[^"]*}}ubsan_standalone-x86_64.lib"			// CHECK-COVERAGE-WIN64: "--dependent-lib={{[^"]*}}ubsan_standalone-x86_64.lib"

	// RUN: %clang -target x86_64-linux-gnu -fsanitize=integer %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-INTEGER -implicit-check-not="-fsanitize-address-use-after-scope"			// RUN: %clang -target x86_64-linux-gnu -fsanitize=integer %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-INTEGER -implicit-check-not="-fsanitize-address-use-after-scope"
	// CHECK-INTEGER: "-fsanitize={{((signed-integer-overflow\|unsigned-integer-overflow\|integer-divide-by-zero\|shift-base\|shift-exponent),?){5}"}}			// CHECK-INTEGER: "-fsanitize={{((signed-integer-overflow\|unsigned-integer-overflow\|integer-divide-by-zero\|shift-base\|shift-exponent\|implicit-integer-truncation),?){6}"}}

				// RUN: %clang -target x86_64-linux-gnu -fsanitize=implicit-conversion %s -### 2>&1 \| FileCheck %s --check-prefixes=CHECK-implicit-conversion,CHECK-implicit-conversion-RECOVER
				// RUN: %clang -target x86_64-linux-gnu -fsanitize=implicit-conversion -fsanitize-recover=implicit-conversion %s -### 2>&1 \| FileCheck %s --check-prefixes=CHECK-implicit-conversion,CHECK-implicit-conversion-RECOVER
				// RUN: %clang -target x86_64-linux-gnu -fsanitize=implicit-conversion -fno-sanitize-recover=implicit-conversion %s -### 2>&1 \| FileCheck %s --check-prefixes=CHECK-implicit-conversion,CHECK-implicit-conversion-NORECOVER
				// RUN: %clang -target x86_64-linux-gnu -fsanitize=implicit-conversion -fsanitize-trap=implicit-conversion %s -### 2>&1 \| FileCheck %s --check-prefixes=CHECK-implicit-conversion,CHECK-implicit-conversion-TRAP
				// CHECK-implicit-conversion: "-fsanitize={{((implicit-integer-truncation),?){1}"}}
				// CHECK-implicit-conversion-RECOVER: "-fsanitize-recover={{((implicit-integer-truncation),?){1}"}}
				// CHECK-implicit-conversion-RECOVER-NOT: "-fno-sanitize-recover={{((implicit-integer-truncation),?){1}"}}
				// CHECK-implicit-conversion-RECOVER-NOT: "-fsanitize-trap={{((implicit-integer-truncation),?){1}"}}
				// CHECK-implicit-conversion-NORECOVER-NOT: "-fno-sanitize-recover={{((implicit-integer-truncation),?){1}"}} // ???
				// CHECK-implicit-conversion-NORECOVER-NOT: "-fsanitize-recover={{((implicit-integer-truncation),?){1}"}}
				// CHECK-implicit-conversion-NORECOVER-NOT: "-fsanitize-trap={{((implicit-integer-truncation),?){1}"}}
				// CHECK-implicit-conversion-TRAP: "-fsanitize-trap={{((implicit-integer-truncation),?){1}"}}
				// CHECK-implicit-conversion-TRAP-NOT: "-fsanitize-recover={{((implicit-integer-truncation),?){1}"}}
				// CHECK-implicit-conversion-TRAP-NOT: "-fno-sanitize-recover={{((implicit-integer-truncation),?){1}"}}

	// RUN: %clang -fsanitize=bounds -### -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix=CHECK-BOUNDS			// RUN: %clang -fsanitize=bounds -### -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix=CHECK-BOUNDS
	// CHECK-BOUNDS: "-fsanitize={{((array-bounds\|local-bounds),?){2}"}}			// CHECK-BOUNDS: "-fsanitize={{((array-bounds\|local-bounds),?){2}"}}

	// RUN: %clang -target x86_64-linux-gnu -fsanitize=all %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-FSANITIZE-ALL			// RUN: %clang -target x86_64-linux-gnu -fsanitize=all %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-FSANITIZE-ALL
	// CHECK-FSANITIZE-ALL: error: unsupported argument 'all' to option 'fsanitize='			// CHECK-FSANITIZE-ALL: error: unsupported argument 'all' to option 'fsanitize='

	// RUN: %clang -target x86_64-linux-gnu -fsanitize=address,undefined -fno-sanitize=all -fsanitize=thread %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-FNO-SANITIZE-ALL			// RUN: %clang -target x86_64-linux-gnu -fsanitize=address,undefined -fno-sanitize=all -fsanitize=thread %s -### 2>&1 \| FileCheck %s --check-prefix=CHECK-FNO-SANITIZE-ALL
	▲ Show 20 Lines • Show All 671 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[clang][ubsan] Implicit Cast Sanitizer - integer truncation - clang partClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 158035

docs/ReleaseNotes.rst

docs/UndefinedBehaviorSanitizer.rst

include/clang/Basic/Sanitizers.h

include/clang/Basic/Sanitizers.def

lib/CodeGen/CGExprScalar.cpp

lib/CodeGen/CodeGenFunction.h

lib/Driver/SanitizerArgs.cpp

lib/Driver/ToolChain.cpp

test/CodeGen/catch-implicit-integer-truncations.c

test/CodeGenCXX/catch-implicit-integer-truncations.cpp

test/Driver/fsanitize.c

[clang][ubsan] Implicit Cast Sanitizer - integer truncation - clang part
ClosedPublic