This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/
-
clang/
-
AST/
2
Redeclarable.h
-
StaticAnalyzer/Core/PathSensitive/
-
Core/
-
PathSensitive/
3
MemRegion.h
-
lib/
-
AST/
2
ASTImporter.cpp
3
ExprConstant.cpp
-
Interp/
7
ByteCodeExprGen.cpp
-
Analysis/
1
CFG.cpp
-
Basic/
2
SourceManager.cpp
-
Targets/
-
NVPTX.cpp
-
CodeGen/
-
CGHLSLRuntime.cpp
-
CodeGenModule.cpp
-
Driver/
-
Multilib.cpp
-
ToolChains/
-
Flang.cpp
-
Format/
-
Format.cpp
-
Frontend/
-
Rewrite/
-
RewriteModernObjC.cpp
-
RewriteObjC.cpp
-
SARIFDiagnostic.cpp
-
Lex/
-
PPDirectives.cpp
-
PreprocessingRecord.cpp
-
Parse/
-
ParseDecl.cpp
-
Sema/
-
SemaChecking.cpp
-
SemaCodeComplete.cpp
-
Serialization/
-
ASTReader.cpp
-
ASTWriter.cpp
-
StaticAnalyzer/Core/
-
Core/
-
CoreEngine.cpp
-
SVals.cpp
-
tools/
-
clang-refactor/
-
TestSupport.cpp
-
libclang/
1
CIndex.cpp
1
CXCursor.cpp
-
utils/TableGen/
-
TableGen/
1
ClangSyntaxEmitter.cpp

Differential D135551

[clang] replace `assert(0)` with `llvm_unreachable` NFC
Needs ReviewPublic

Authored by inclyc on Oct 9 2022, 8:29 PM.

Download Raw Diff

Details

Reviewers

aaron.ballman
fhahn
dexonsmith
shafik
sscalpone
NoQ

Summary

The comment where llvm_unreachable says

Marks that the current location is not supposed to be reachable.
In !NDEBUG builds, prints the message and location info to stderr.
In NDEBUG builds, if the platform does not support a builtin unreachable
then we call an internal LLVM runtime function. Otherwise the behavior is
controlled by the CMake flag
  -DLLVM_UNREACHABLE_OPTIMIZE
* When "ON" (default) llvm_unreachable() becomes an optimizer hint
  that the current location is not supposed to be reachable: the hint
  turns such code path into undefined behavior.  On compilers that don't
  support such hints, prints a reduced message instead and aborts the
  program.
* When "OFF", a builtin_trap is emitted instead of an
  optimizer hint or printing a reduced message.

Use this instead of assert(0). It conveys intent more clearly, suppresses
diagnostics for unreachable code paths, and allows compilers to omit
unnecessary code.

We have discussions on the discourse here: https://discourse.llvm.org/t/llvm-unreachable-is-widely-misused/60587/8

Link: https://github.com/llvm/llvm-project/blob/50312ea133999cb2aad1ab9ef0ec39429a9427c5/llvm/include/llvm/Support/ErrorHandling.h#L125
Link: https://discourse.llvm.org/t/llvm-unreachable-is-widely-misused/60587/8

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

inclyc created this revision.Oct 9 2022, 8:29 PM

Herald added a reviewer: shafik. · View Herald TranscriptOct 9 2022, 8:29 PM

Herald added a reviewer: sscalpone. · View Herald Transcript

Herald added a reviewer: NoQ. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: steakhal, mattd, gchakrabarti and 3 others. · View Herald Transcript

inclyc requested review of this revision.Oct 9 2022, 8:29 PM

Herald added subscribers: MaskRay, jholewinski. · View Herald TranscriptOct 9 2022, 8:29 PM

Harbormaster completed remote builds in B191207: Diff 466416.Oct 9 2022, 9:05 PM

liaolucy added a subscriber: liaolucy.Oct 9 2022, 11:44 PM

TODO: Fix CI

Fix CI issue

Herald added a project: Restricted Project. · View Herald TranscriptOct 10 2022, 4:52 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B191249: Diff 466471.Oct 10 2022, 5:46 AM

I don't know if that discussion reached a conclusion to move forward with this change -- my reading of the conversation was that efforts would be better spend on fuzzing instead of changing policy about using unreachable vs assert(0).

In general, I'm a bit hesitant to make this change. On the one hand, it's logically no worse than using assert(0) in a release build (if you hit this code path, bad things are going to happen). But __builtin_unreachable can have time travel optimization effects that assert doesn't have, and so the kind of bad things which can happen are different between the two (and use of unreachable on reachable code paths might make for harder debugging in RelWithDebInfo builds). Historically, we've usually used llvm_unreachable for situations where we're saying "this code cannot be reached; if it can, something else has gone seriously wrong." For example, in code like: int foo(SomeEnum E) { switch (E) { case One: return 1; default: return 2; } llvm_unreachable("getting here would be mysterious"); } and we've used assert(0) for situations where we're saying "this code is possible to reach only when there were mistakes elsewhere which broke invariants we were relying on." The two situations are similar, but still different enough that I don't think we should wholesale change from one form to another.

In D135551#3846592, @aaron.ballman wrote:

I don't know if that discussion reached a conclusion to move forward with this change -- my reading of the conversation was that efforts would be better spend on fuzzing instead of changing policy about using unreachable vs assert(0).

In general, I'm a bit hesitant to make this change. On the one hand, it's logically no worse than using assert(0) in a release build (if you hit this code path, bad things are going to happen). But __builtin_unreachable can have time travel optimization effects that assert doesn't have, and so the kind of bad things which can happen are different between the two (and use of unreachable on reachable code paths might make for harder debugging in RelWithDebInfo builds). Historically, we've usually used llvm_unreachable for situations where we're saying "this code cannot be reached; if it can, something else has gone seriously wrong." For example, in code like: int foo(SomeEnum E) { switch (E) { case One: return 1; default: return 2; } llvm_unreachable("getting here would be mysterious"); } and we've used assert(0) for situations where we're saying "this code is possible to reach only when there were mistakes elsewhere which broke invariants we were relying on." The two situations are similar, but still different enough that I don't think we should wholesale change from one form to another.

but still different enough that I don't think we should wholesale change from one form to another.

In general we can control the behavior here via -DLLVM_UNREACHABLE_OPTIMIZE to choose making assumptions or traps (looks better than assertions to me).

https://github.com/llvm/llvm-project/blob/50312ea133999cb2aad1ab9ef0ec39429a9427c5/llvm/include/llvm/Support/ErrorHandling.h#L125

(This change was landed 7 months ago https://reviews.llvm.org/D121750)

In D135551#3846603, @inclyc wrote:

In D135551#3846592, @aaron.ballman wrote:

I don't know if that discussion reached a conclusion to move forward with this change -- my reading of the conversation was that efforts would be better spend on fuzzing instead of changing policy about using unreachable vs assert(0).

In general, I'm a bit hesitant to make this change. On the one hand, it's logically no worse than using assert(0) in a release build (if you hit this code path, bad things are going to happen). But __builtin_unreachable can have time travel optimization effects that assert doesn't have, and so the kind of bad things which can happen are different between the two (and use of unreachable on reachable code paths might make for harder debugging in RelWithDebInfo builds). Historically, we've usually used llvm_unreachable for situations where we're saying "this code cannot be reached; if it can, something else has gone seriously wrong." For example, in code like: int foo(SomeEnum E) { switch (E) { case One: return 1; default: return 2; } llvm_unreachable("getting here would be mysterious"); } and we've used assert(0) for situations where we're saying "this code is possible to reach only when there were mistakes elsewhere which broke invariants we were relying on." The two situations are similar, but still different enough that I don't think we should wholesale change from one form to another.

but still different enough that I don't think we should wholesale change from one form to another.

In general we can control the behavior here via -DLLVM_UNREACHABLE_OPTIMIZE to choose making assumptions or traps (looks better than assertions to me).

https://github.com/llvm/llvm-project/blob/50312ea133999cb2aad1ab9ef0ec39429a9427c5/llvm/include/llvm/Support/ErrorHandling.h#L125

(This change was landed 7 months ago https://reviews.llvm.org/D121750)

That doesn't change the underlying concern that assert(0) and unreachable are used for different purposes and trying to unify those use cases might lose some expressivity in the code base.

I've left some comments in the review about examples of my concerns (it's not an exhaustive review).

clang/tools/libclang/CIndex.cpp
5191	This one is a bit questionable -- this is part of the C interface we expose, which is ABI stable, so the assert was alerting users to potential mismatches between versions of the library.
clang/tools/libclang/CXCursor.cpp
1490	Each of these is actually reachable -- the asserts exist specifically to tell users of the C interface about problems with their assumptions. In each of these cases, the assert is avoiding the need for a local variable to assert on.
clang/utils/TableGen/ClangSyntaxEmitter.cpp
120	This should not be using unreachable -- the code is very much reachable. This should have changed from `assert` to `PrintFatalError`.

Address comments

I've left some comments in the review about examples of my concerns (it's not an exhaustive review).

Thanks @aaron.ballman ! I didn't quite understand the original meaning of this code here (e.g. libclang), and I have now removed the relevant changes. I think this patch should replace the code that accidentally misuses of assert(0) with llvm_unreachable().

Harbormaster completed remote builds in B191308: Diff 466539.Oct 10 2022, 10:52 AM

dblaikie added a subscriber: lhames.Oct 10 2022, 11:16 AM

dblaikie added a subscriber: dblaikie.

I thought this was settled quite a while ago and enshrined in the style guide: https://llvm.org/docs/CodingStandards.html#assert-liberally

assert(0) should not be used if something is reachable. We shouldn't have a "this violates an invariant, but if you don't have asserts enabled you do get some maybe-acceptable behavior".

I feel fairly strongly that any cases of "reachable asserts" should be changed to valid error handling or llvm_report_error and remaining assert(0) should be transformed to llvm_unreachable. (also, ideally, don't use branch-to-unreachable where possible, instead assert the condition - in cases where the if has side effects, sometimes that's the nicest way to write it, but might be clearer, if more verbose to use a local variable for the condition, then assert that the variable is true (and have the requisite "variable might be unused" cast))

Historically, we've usually used llvm_unreachable for situations where we're saying "this code cannot be reached; if it can, something else has gone seriously wrong." For example, in code like: int foo(SomeEnum E) { switch (E) { case One: return 1; default: return 2; } llvm_unreachable("getting here would be mysterious"); } and we've used assert(0) for situations where we're saying "this code is possible to reach only when there were mistakes elsewhere which broke invariants we were relying on."

I don't think those are different things though - violating invariants is ~= something going seriously wrong.

(sorry, I guess I should debate this on the thread instead of here - but I think most of the folks on that thread did agree with the LLVM style guide/the direction here)

This, I think was also discussed about a decade ago in the LLVM community and resulted in r166821/2962d9599e463265edae599285bbc6351f1cc0ef which specifically "Suggests llvm_unreachable over assert(0)" and is the policy of LLVM - this change is consistent with that policy.

Can't seem to find an llvm-dev/commits discussion for r166821, but I remember discussing it several times before, perhaps this one happened on IRC and so we may not have any record of it.

In D135551#3847444, @dblaikie wrote:

I thought this was settled quite a while ago and enshrined in the style guide: https://llvm.org/docs/CodingStandards.html#assert-liberally

assert(0) should not be used if something is reachable. We shouldn't have a "this violates an invariant, but if you don't have asserts enabled you do get some maybe-acceptable behavior".

I feel fairly strongly that any cases of "reachable asserts" should be changed to valid error handling or llvm_report_error and remaining assert(0) should be transformed to llvm_unreachable. (also, ideally, don't use branch-to-unreachable where possible, instead assert the condition - in cases where the if has side effects, sometimes that's the nicest way to write it, but might be clearer, if more verbose to use a local variable for the condition, then assert that the variable is true (and have the requisite "variable might be unused" cast))

I would be okay with that, but that's not what this patch was doing -- it was changing assert(0) into an llvm_unreachable more mechanically, and I don't think that's a valid transformation. The key, to me, is not losing the distinction between "reaching here is a programming mistake that you'd make during active development" vs "we never expect to reach this patch and want to optimize accordingly." __builtin_unreachable changes the debugging landscape far too much for me to want to see folks using it for "reaching here is a programming mistake" situations, *especially* in RelWithDebInfo builds where optimizations are enabled and may result in surprising call stacks and time travel debugging.

Historically, we've usually used llvm_unreachable for situations where we're saying "this code cannot be reached; if it can, something else has gone seriously wrong." For example, in code like: int foo(SomeEnum E) { switch (E) { case One: return 1; default: return 2; } llvm_unreachable("getting here would be mysterious"); } and we've used assert(0) for situations where we're saying "this code is possible to reach only when there were mistakes elsewhere which broke invariants we were relying on."

I don't think those are different things though - violating invariants is ~= something going seriously wrong.

(sorry, I guess I should debate this on the thread instead of here - but I think most of the folks on that thread did agree with the LLVM style guide/the direction here)

This, I think was also discussed about a decade ago in the LLVM community and resulted in r166821/2962d9599e463265edae599285bbc6351f1cc0ef which specifically "Suggests llvm_unreachable over assert(0)" and is the policy of LLVM - this change is consistent with that policy.

I don't have the context for those changes in my email either, but regardless of what we thought ten years ago, we have code in the code base today that assumes a difference in severity between kinds of unreachable statements so we need to be careful when correcting mistakes. I think we're still in agreement that llvm_unreachable should be preferred over assert(0) in situations where the code is expected to be impossible to reach. I think we're also still in agreement that "correct error reporting" is preferred over assert(0). Where we might still have daylight are the occasional situations where assert(0) gives a better experience -- when the code is possible to reach but reaching it signifies a developer (not user) mistake that is plausible to make when doing new development, such as when misusing the C interface (where there isn't always an error that should be reported via the API). I think these uses should generally be rare, but I don't think it's a "never do this" kind of situation.

llvm_unreachable asserts in debug mode, so it has the nice property we need when doing new development. But it doesn't convey the distinction in semantics. Maybe another approach is: #define developer_bugcheck(msg) llvm_unreachable(msg) (or something along those lines). We still get the assertion in debug mode, we then get the better optimization in release mode, but we don't lose the semantic information in the code base. It doesn't really help the RelWithDebInfo case though (where call stacks may be unrecognizable as a result of time travel optimizations) but maybe that's a tradeoff worth making?

I would think we could convert every assert(0) to either llvm::report_fatal_error (guaranteed trap) or llvm_unreachable() (trap or optimize, depending on CMAKE configuration). The C API usage checks seem like good candidates for the former.

Also, not sure if everyone noticed, but the latter can now be configured to always trap by turning off the “optimize” CMAKE flag. This seems useful for fuzzing situations where you may not want asserts builds.

In D135551#3847607, @aaron.ballman wrote:

In D135551#3847444, @dblaikie wrote:

I thought this was settled quite a while ago and enshrined in the style guide: https://llvm.org/docs/CodingStandards.html#assert-liberally

assert(0) should not be used if something is reachable. We shouldn't have a "this violates an invariant, but if you don't have asserts enabled you do get some maybe-acceptable behavior".

I feel fairly strongly that any cases of "reachable asserts" should be changed to valid error handling or llvm_report_error and remaining assert(0) should be transformed to llvm_unreachable. (also, ideally, don't use branch-to-unreachable where possible, instead assert the condition - in cases where the if has side effects, sometimes that's the nicest way to write it, but might be clearer, if more verbose to use a local variable for the condition, then assert that the variable is true (and have the requisite "variable might be unused" cast))

I would be okay with that, but that's not what this patch was doing -- it was changing assert(0) into an llvm_unreachable more mechanically, and I don't think that's a valid transformation. The key, to me, is not losing the distinction between "reaching here is a programming mistake that you'd make during active development" vs "we never expect to reach this patch and want to optimize accordingly."

I don't really think those are different things, though. Violating invariants is UB and there's no discussion to be had about how the program (in a non-asserts build) behaves when those invariants are violated - all bets are off, whether it's assert or unreachable.

__builtin_unreachable changes the debugging landscape far too much for me to want to see folks using it for "reaching here is a programming mistake" situations, *especially* in RelWithDebInfo builds where optimizations are enabled and may result in surprising call stacks and time travel debugging.

Generally LLVM's pretty hard to fathom in a non-asserts build anyway, right? (that's the first thing any of us do is reproduce with an assertions build that may fail miles away from where a crash occurred because an invariant was violated much earlier) - that cast won't crash/will continue on happily in a non-asserts build seems like a much larger hole to debuggability of a non-asserts build than any unreachable?

Historically, we've usually used llvm_unreachable for situations where we're saying "this code cannot be reached; if it can, something else has gone seriously wrong." For example, in code like: int foo(SomeEnum E) { switch (E) { case One: return 1; default: return 2; } llvm_unreachable("getting here would be mysterious"); } and we've used assert(0) for situations where we're saying "this code is possible to reach only when there were mistakes elsewhere which broke invariants we were relying on."

I don't think those are different things though - violating invariants is ~= something going seriously wrong.

(sorry, I guess I should debate this on the thread instead of here - but I think most of the folks on that thread did agree with the LLVM style guide/the direction here)

This, I think was also discussed about a decade ago in the LLVM community and resulted in r166821/2962d9599e463265edae599285bbc6351f1cc0ef which specifically "Suggests llvm_unreachable over assert(0)" and is the policy of LLVM - this change is consistent with that policy.

I don't have the context for those changes in my email either, but regardless of what we thought ten years ago, we have code in the code base today that assumes a difference in severity between kinds of unreachable statements so we need to be careful when correcting mistakes. I think we're still in agreement that llvm_unreachable should be preferred over assert(0) in situations where the code is expected to be impossible to reach. I think we're also still in agreement that "correct error reporting" is preferred over assert(0). Where we might still have daylight are the occasional situations where assert(0) gives a better experience -- when the code is possible to reach but reaching it signifies a developer (not user) mistake that is plausible to make when doing new development, such as when misusing the C interface (where there isn't always an error that should be reported via the API). I think these uses should generally be rare, but I don't think it's a "never do this" kind of situation.

That still seems like something that should be caught by an asserts build and is UB otherwise. If you're developing against LLVM's APIs in a non-assertions build, there's a lot of other invariants that won't be checked (cast being a pretty core example) and will make debugging really difficult.

llvm_unreachable asserts in debug mode, so it has the nice property we need when doing new development. But it doesn't convey the distinction in semantics. Maybe another approach is: #define developer_bugcheck(msg) llvm_unreachable(msg) (or something along those lines). We still get the assertion in debug mode, we then get the better optimization in release mode, but we don't lose the semantic information in the code base. It doesn't really help the RelWithDebInfo case though (where call stacks may be unrecognizable as a result of time travel optimizations) but maybe that's a tradeoff worth making?

I don't really see the distinction, though - it's a violated invariant, which is a developer bug, whether it's from an assert on unreachable. They're both expressing invariants - not of different strength. "Valid code should never reach this branch" and "valid code should never produce a false result here".

I'd really like to avoid what I see as cleanup - replacing one construct (assert(false)) with another (llvm_unreachable) of identical (to me) contractual strength - being slowed because of this distinction. Replacing reachable-unreachables, or reachable-false-assertions with llvm_report_error, to me, is orthogonal to this cleanup.

(& for the C API - I think assertions/unreachable are the right thing, not llvm_report_error - I'm sure there's many more ways to violate the C API contract that would result in inexplicable/confusing behavior and that will only ever be protected by assertions, than we might cover by a few llvm_report_errors on the interface & that should be the direction we encourage people to go down - develop with assertions enabled)

Basically llvm_report_error ends up/should be a "we couldn't find a better way to do error handling here, but it is really a reachable error we have to do something with" - each one is a bug in the library-ness of LLVM. (or it's used up in a tool implementation (above the library layer) and then it's fine/just a convenient way to error/exit, though probably still isn't so good for diagnostic quality from such tools). If it's not reachable by a correct usage (be it from a command line tool or API) then it should be an assertion, not an llvm_report_error.

But I know this issue comes up from time to time and I've yet to figure out a way to come to shared understandings on it :/ so I don't expect my rather narrow and firm definition to necessarily carry the day on this.

Because I was looking, some other assert(0) -> llvm_unreachable cleanups (though, yes, even the earliest cleanups include some assert(0) -> report_fatal_error, but for externally/user-reachable failures, like invalid bitcode, I think). Some of these are more blanket/wide reaching than others, for sure. (& no doubt the naive search through the commit log doesn't find all the cleanups)

I /think/ when I was looking I did find one from 2017 that Richard Smith approved that included llvm_unreachable in the Clang C API, in terms of more recent precedent for some of this... but can't find that again right now.

rGabd1561f15e:[LLDBAssert] Use unreachable instead of assert(0)
rGf9da10ce4544ea66fe6fad5b943d3700d192a1e1:Change to assert(0,x) to llvm_unreachable(x)
rG7a247f709baa2cd19111af9e18965df4e419949a:Turn effective assert(0) into llvm_unreachable
rG35b2f75733c98e5904c5a75f8bcedeb96c4f4eda:Convert some assert(0) to llvm_unreachable or fold an 'if' condition into the assert.
rGd8d43191d8afe2c4b5b2d3be62cd73ddc3ddc6c9:Replace some assert(0)'s with llvm_unreachable.
rG2a30d7889fc54c8a74d73b79be3dd030bac41b06:Replace some assert(0)'s with llvm_unreachable.
rG0039f3f0607702f2d16d60addff74c67869e2144:Replace some assert(0)'s with llvm_unreachable.
rGc7193c48d9d7e44e9fd0c39205e8b7cfbf5d5458:Convert assert(0) to llvm_unreachable to silence a warning about Addend being uninitialized in default case.
rG7b7a67c5c8daa051e42837dc3e5e65adab9cf09c:[ARM64] Fix 'assert("...")' to be 'assert(0 && "...")'. Otherwise, it is no assert at all. ;] Some of these should probably be switched to llvm_unreachable, but I didn't want to perturb the behavior in this patch.
rGeaa3a7efab65d0a65ddda7fdb7e5fbdbd5f897ad:Use llvm_unreachable instead of assert(0)
rGd7fd95a5c1eb19e5754281b540f09e86ced1b9d4:Change assert(0 && "text") to llvm_unreachable(0 && "text")
rG2962d9599e463265edae599285bbc6351f1cc0ef:Suggest llvm_unreachable over assert(0).
rG2e007de42de48dc05bdb7aa9d3c9e8902ee720fe:Revert "Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0)."
rGdc4261794fdbf2e3001f369d8b8bbd77eb923602:Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0).
rG751eb3d2b30d90069b1797a31f8e489f0763c585:use llvm_unreachable() instead of assert(0) for invalid enum values in switch statements
rGbdf39a46a302df7cb07e948a6780265b3335fbda:Convert assert(0) to llvm_unreachable.
rGeb455832b4c7cb1046ab9fb9f6f8ca40202874a4:Silence various build warnings from Hexagon backend that show up in release builds. Mostly converting 'assert(0)' to 'llvm_unreachable' to silence warnings about missing returns. Also fold some variable declarations into asserts to prevent the variables from being unused in release builds.
rGabd1561f15ee466c0fd9abeede2cdcde2ebb2cec:[LLDBAssert] Use unreachable instead of assert(0)
rGf9da10ce4544ea66fe6fad5b943d3700d192a1e1:Change to assert(0,x) to llvm_unreachable(x)
rG7a247f709baa2cd19111af9e18965df4e419949a:Turn effective assert(0) into llvm_unreachable
rG35b2f75733c98e5904c5a75f8bcedeb96c4f4eda:Convert some assert(0) to llvm_unreachable or fold an 'if' condition into the assert.
rGd8d43191d8afe2c4b5b2d3be62cd73ddc3ddc6c9:Replace some assert(0)'s with llvm_unreachable.
rG2a30d7889fc54c8a74d73b79be3dd030bac41b06:Replace some assert(0)'s with llvm_unreachable.
rG0039f3f0607702f2d16d60addff74c67869e2144:Replace some assert(0)'s with llvm_unreachable.
rGc7193c48d9d7e44e9fd0c39205e8b7cfbf5d5458:Convert assert(0) to llvm_unreachable to silence a warning about Addend being uninitialized in default case.
rG7b7a67c5c8daa051e42837dc3e5e65adab9cf09c:[ARM64] Fix 'assert("...")' to be 'assert(0 && "...")'. Otherwise, it is no assert at all. ;] Some of these should probably be switched to llvm_unreachable, but I didn't want to perturb the behavior in this patch.
rGeaa3a7efab65d0a65ddda7fdb7e5fbdbd5f897ad:Use llvm_unreachable instead of assert(0)
rGd7fd95a5c1eb19e5754281b540f09e86ced1b9d4:Change assert(0 && "text") to llvm_unreachable(0 && "text")
rG2962d9599e463265edae599285bbc6351f1cc0ef:Suggest llvm_unreachable over assert(0).
rG2e007de42de48dc05bdb7aa9d3c9e8902ee720fe:Revert "Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0)."
rGdc4261794fdbf2e3001f369d8b8bbd77eb923602:Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0).
rG751eb3d2b30d90069b1797a31f8e489f0763c585:use llvm_unreachable() instead of assert(0) for invalid enum values in switch statements
rGbdf39a46a302df7cb07e948a6780265b3335fbda:Convert assert(0) to llvm_unreachable.
rGeb455832b4c7cb1046ab9fb9f6f8ca40202874a4:Silence various build warnings from Hexagon backend that show up in release builds. Mostly converting 'assert(0)' to 'llvm_unreachable' to silence warnings about missing returns. Also fold some variable declarations into asserts to prevent the variables from being unused in release builds.
rG8619c37b5b8795fcc722373f5fd9a5d0c07195af:Replace assert(0) with llvm_unreachable to avoid warnings about dropping off the end of a non-void function in Release builds.
rGa2886c21d9a08d63c324cc61aa91ae0893507a31:Convert assert(0) to llvm_unreachable
rGe55c556a247a9c0decb4e256d9e897dfc9cf841d:Convert assert(0) to llvm_unreachable
rGc514b5474a3feb6b4b2775b75c4f8eaf0676a9d0:Convert assert(0) to llvm_unreachable
rGee4dab5f1f7a0c32167b8b91c5733e77d4d88dcc:Convert assert(0) to llvm_unreachable
rGc4965bce14249ad13bd532af827bdd5c21b340fd:Convert assert(0) to llvm_unreachable
rG4ed7278ff4ef206d01867fda76bf90df36398c4c:Convert assert(0) to llvm_unreachable in X86 Target directory.
rG83f3bdaa457e7ac28fdda724808be8fd4f1d275d:Convert some assert(0) in default of switch statements to llvm_unreachable.
rG83d382b1cad133cb163a68dd7149fae2802275e1:Switch assert(0/false) llvm_unreachable.
rGea431722972fa481bbe2898d180678a08f977fa0:Prefer llvm_unreachable to assert(0)
rG1ab40bef8dfe416cc82455eea39d01d496752d90:After converting assert(0) to LLVM_UNREACHABLE we lost file/line location. Fix by making the LLVM_UNREACHABLE pass FILE and LINE to llvm_unreachable.
rG56d065972602c45a4109617f32eb8605e5017c5e:assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds.
rG6dd2730024c20647b92ddfbb80e9a8bf33308ccd:Start converting to new error handling API. cerr+abort -> llvm_report_error assert(0)+abort -> LLVM_UNREACHABLE (assert(0)+llvm_unreachable-> abort() included)

In D135551#3847962, @dblaikie wrote:

In D135551#3847607, @aaron.ballman wrote:

In D135551#3847444, @dblaikie wrote:

I thought this was settled quite a while ago and enshrined in the style guide: https://llvm.org/docs/CodingStandards.html#assert-liberally

assert(0) should not be used if something is reachable. We shouldn't have a "this violates an invariant, but if you don't have asserts enabled you do get some maybe-acceptable behavior".

I feel fairly strongly that any cases of "reachable asserts" should be changed to valid error handling or llvm_report_error and remaining assert(0) should be transformed to llvm_unreachable. (also, ideally, don't use branch-to-unreachable where possible, instead assert the condition - in cases where the if has side effects, sometimes that's the nicest way to write it, but might be clearer, if more verbose to use a local variable for the condition, then assert that the variable is true (and have the requisite "variable might be unused" cast))

I would be okay with that, but that's not what this patch was doing -- it was changing assert(0) into an llvm_unreachable more mechanically, and I don't think that's a valid transformation. The key, to me, is not losing the distinction between "reaching here is a programming mistake that you'd make during active development" vs "we never expect to reach this patch and want to optimize accordingly."

I don't really think those are different things, though. Violating invariants is UB and there's no discussion to be had about how the program (in a non-asserts build) behaves when those invariants are violated - all bets are off, whether it's assert or unreachable.

I think they're the same thing in terms of runtime behavior, but I feel (rather strongly) that they're different in terms of documentation when reading the source. This code pattern exists and keeps coming up year after year, which is sufficient to inform me that the community thinks there is *some* distinction to be made there. Also, the fact that we have report_fatal_error *and* unreachable APIs signals that we already understand there's a distinction between "reaching here will never happen; optimize accordingly" and "reaching here is a surprising mistake".

__builtin_unreachable changes the debugging landscape far too much for me to want to see folks using it for "reaching here is a programming mistake" situations, *especially* in RelWithDebInfo builds where optimizations are enabled and may result in surprising call stacks and time travel debugging.

Generally LLVM's pretty hard to fathom in a non-asserts build anyway, right? (that's the first thing any of us do is reproduce with an assertions build that may fail miles away from where a crash occurred because an invariant was violated much earlier) - that cast won't crash/will continue on happily in a non-asserts build seems like a much larger hole to debuggability of a non-asserts build than any unreachable?

This might be true -- personally, I tend to only use debug builds with MSVC because RelWithDebInfo isn't sufficient for my daily needs. However, I've definitely heard of folks who use RelWithDebInfo for their daily work (RelWithDebInfo + Asserts specifically, IIRC) because of the improved build times and runtime performance; we should be sure we're not disrupting that workflow too much.

Historically, we've usually used llvm_unreachable for situations where we're saying "this code cannot be reached; if it can, something else has gone seriously wrong." For example, in code like: int foo(SomeEnum E) { switch (E) { case One: return 1; default: return 2; } llvm_unreachable("getting here would be mysterious"); } and we've used assert(0) for situations where we're saying "this code is possible to reach only when there were mistakes elsewhere which broke invariants we were relying on."

I don't think those are different things though - violating invariants is ~= something going seriously wrong.

(sorry, I guess I should debate this on the thread instead of here - but I think most of the folks on that thread did agree with the LLVM style guide/the direction here)

This, I think was also discussed about a decade ago in the LLVM community and resulted in r166821/2962d9599e463265edae599285bbc6351f1cc0ef which specifically "Suggests llvm_unreachable over assert(0)" and is the policy of LLVM - this change is consistent with that policy.

I don't have the context for those changes in my email either, but regardless of what we thought ten years ago, we have code in the code base today that assumes a difference in severity between kinds of unreachable statements so we need to be careful when correcting mistakes. I think we're still in agreement that llvm_unreachable should be preferred over assert(0) in situations where the code is expected to be impossible to reach. I think we're also still in agreement that "correct error reporting" is preferred over assert(0). Where we might still have daylight are the occasional situations where assert(0) gives a better experience -- when the code is possible to reach but reaching it signifies a developer (not user) mistake that is plausible to make when doing new development, such as when misusing the C interface (where there isn't always an error that should be reported via the API). I think these uses should generally be rare, but I don't think it's a "never do this" kind of situation.

That still seems like something that should be caught by an asserts build and is UB otherwise. If you're developing against LLVM's APIs in a non-assertions build, there's a lot of other invariants that won't be checked (cast being a pretty core example) and will make debugging really difficult.

llvm_unreachable asserts in debug mode, so it has the nice property we need when doing new development. But it doesn't convey the distinction in semantics. Maybe another approach is: #define developer_bugcheck(msg) llvm_unreachable(msg) (or something along those lines). We still get the assertion in debug mode, we then get the better optimization in release mode, but we don't lose the semantic information in the code base. It doesn't really help the RelWithDebInfo case though (where call stacks may be unrecognizable as a result of time travel optimizations) but maybe that's a tradeoff worth making?

I don't really see the distinction, though - it's a violated invariant, which is a developer bug, whether it's from an assert on unreachable. They're both expressing invariants - not of different strength. "Valid code should never reach this branch" and "valid code should never produce a false result here".

There are different kinds of developer bugs and it's reasonable for people to want to express them differently, IMO. Concretely with a contrived example:

enum E { Zero, One, Two };

int func(E Val) {
  switch (Val) {
  case Zero: return 12;
  case One: return 200;
  case Two: return -1;
  }
  llvm_unreachable("never get here");
}

This is a case where the code is technically reachable (someone can call func((E)3)) but the programmer's intent it "nobody will ever do that" because it's an internal API, not exposed to C, other safeguards like diagnostics ensure it, or whatever. Contrast this with:

enum E { Zero, One, Two };

extern "C" int func(unsigned Val) {
  switch (Val) {
  case Zero: return 12;
  case One: return 200;
  case Two: return -1;
  }
  assert(0 && "never get here");
}

Same general situation, but now the safeguards are no longer in place. The function is externally callable by anyone who wants to use it, the interface has lost the connection to the enumeration, etc. So the code is also technically reachable, but the author wants to signal the difference in plausibility of reaching the erroneous state.

I definitely agree that the end results of reaching the unreachable bits are the same in terms of what happens at runtime. But I'm worried about maintaining the code -- losing the distinction between the two situations makes the person debugging the failure have to figure out whether someone missed adding a safeguard elsewhere vs something more difficult to solve such as a miscompile from the host implementation, stack stomping, etc.

I don't care whether we spell it assert(0) or report_fatal_error() or something else. I care that we don't have a policy requiring folks to not make a distinction between the two scenarios aside from leaving comments in the code as I think that's a step backwards.

I'd really like to avoid what I see as cleanup - replacing one construct (assert(false)) with another (llvm_unreachable) of identical (to me) contractual strength - being slowed because of this distinction. Replacing reachable-unreachables, or reachable-false-assertions with llvm_report_error, to me, is orthogonal to this cleanup.

Understood. I'm pushing back on this being a cleanup in all cases.

(& for the C API - I think assertions/unreachable are the right thing, not llvm_report_error - I'm sure there's many more ways to violate the C API contract that would result in inexplicable/confusing behavior and that will only ever be protected by assertions, than we might cover by a few llvm_report_errors on the interface & that should be the direction we encourage people to go down - develop with assertions enabled)

Basically llvm_report_error ends up/should be a "we couldn't find a better way to do error handling here, but it is really a reachable error we have to do something with" - each one is a bug in the library-ness of LLVM. (or it's used up in a tool implementation (above the library layer) and then it's fine/just a convenient way to error/exit, though probably still isn't so good for diagnostic quality from such tools). If it's not reachable by a correct usage (be it from a command line tool or API) then it should be an assertion, not an llvm_report_error.

But I know this issue comes up from time to time and I've yet to figure out a way to come to shared understandings on it :/ so I don't expect my rather narrow and firm definition to necessarily carry the day on this.

Concrete guidance around reachability that I would be happy with is:

Prefer a user-facing diagnostic in any circumstance that's under the user's direct control (through command line options, source code, etc); users should not get failed assertions/crashes as a form of error reporting.
Prefer assert(condition) in any circumstance under which there is a condition to be checked to validate the state of the system. The only use of a literal in such an assertion should be a string literal to use as a message. e.g., no assert(0);
Prefer llvm_report_error() in any circumstance under which a code path is functionally possible to reach, but only in erroneous executions that signify a mistake on the part of the LLVM developer elsewhere in the program.
Prefer llvm_unreachable() in any circumstance under which a code path is believed to be functionally impossible to reach (even if technically possible to reach). The API is now self-documenting to mean "this code really should be totally unreachable".

Something is functionally possible to reach when there are missing safeguards elsewhere that should have prevented calling the function in that state. Something is functionally impossible to reach when miscompiles, undefined behavior, or malicious users modifying memory with a debugger (etc) are the realistic ways to reach that state. When in doubt as to whether something is functionally possible to reach or not, use your best judgement but prefer llvm_report_error on the assumption that most reachability mistakes are LLVM developer errors rather than "extenuating circumstance" kind of situations.

That said, I have no idea if I'm being too pedantic here. I'm basing this off my own experiences with the code base as well as internal discussions with other folks at Intel working on the project and what their expectations are (both as new-to-the-code-base folks and long-time contributors), but this is a pretty small selection of people.

Generally LLVM's pretty hard to fathom in a non-asserts build anyway, right? (that's the first thing any of us do is reproduce with an assertions build that may fail miles away from where a crash occurred because an invariant was violated much earlier) - that cast won't crash/will continue on happily in a non-asserts build seems like a much larger hole to debuggability of a non-asserts build than any unreachable?

This might be true -- personally, I tend to only use debug builds with MSVC because RelWithDebInfo isn't sufficient for my daily needs. However, I've definitely heard of folks who use RelWithDebInfo for their daily work (RelWithDebInfo + Asserts specifically, IIRC) because of the improved build times and runtime performance; we should be sure we're not disrupting that workflow too much.

Changing assert(0) to llvm_unreachable does not change the behavior of any +Asserts build. The behavior of unreachable is the same as assert(0) in a +Asserts build.

Is this observation enough to undeadlock this conversation?

In D135551#3849816, @dblaikie wrote:

Generally LLVM's pretty hard to fathom in a non-asserts build anyway, right? (that's the first thing any of us do is reproduce with an assertions build that may fail miles away from where a crash occurred because an invariant was violated much earlier) - that cast won't crash/will continue on happily in a non-asserts build seems like a much larger hole to debuggability of a non-asserts build than any unreachable?

This might be true -- personally, I tend to only use debug builds with MSVC because RelWithDebInfo isn't sufficient for my daily needs. However, I've definitely heard of folks who use RelWithDebInfo for their daily work (RelWithDebInfo + Asserts specifically, IIRC) because of the improved build times and runtime performance; we should be sure we're not disrupting that workflow too much.

Changing assert(0) to llvm_unreachable does not change the behavior of any +Asserts build. The behavior of unreachable is the same as assert(0) in a +Asserts build.

Is this observation enough to undeadlock this conversation?

No, it doesn't address the key point I have which is that I want different APIs to express intent instead of using llvm_unreachable in all cases. To me, it is a mistake to label functionally reachable code as being unconditionally unreachable. It requires everyone reading that code to understand our nuances to know that the API signals aspirationally unreachable code rather than functionally unreachable code. These are different scenarios and I don't want to lose that distinction in the places where it matters.

Prefer llvm_report_error() in any circumstance under which a code path is functionally possible to reach, but only in erroneous executions that signify a mistake on the part of the LLVM developer elsewhere in the program.

Prefer llvm_unreachable() in any circumstance under which a code path is believed to be functionally impossible to reach (even if technically possible to reach). The API is now self-documenting to mean "this code really should be totally unreachable".

I think llvm_unreachable already has the functionality reporting bugs for developer in our implementation, with +Assertions by default

In D135551#3849944, @inclyc wrote:

Prefer llvm_report_error() in any circumstance under which a code path is functionally possible to reach, but only in erroneous executions that signify a mistake on the part of the LLVM developer elsewhere in the program.

Prefer llvm_unreachable() in any circumstance under which a code path is believed to be functionally impossible to reach (even if technically possible to reach). The API is now self-documenting to mean "this code really should be totally unreachable".

I think llvm_unreachable already has the functionality reporting bugs for developer in our implementation, with +Assertions by default

Yes, in terms of its runtime behavior. So long as we're not making debugging harder for some large group of people, the runtime behavior is not really what I'm concerned by though. I'm focusing more on code reviewers and project newcomers and whether our code is self-documenting or not. Having a policy to use an API that says code is not reachable in situations where that code is very much reachable is the crux of my problem -- the API is sometimes a lie (and a lie with optimization impacts, at that) and we force everyone to pay the cognitive costs associated with that when reading code.

If we end up with two APIs that have the same runtime behavior, I'm okay with that.

In D135551#3849983, @aaron.ballman wrote:

In D135551#3849944, @inclyc wrote:

Prefer llvm_report_error() in any circumstance under which a code path is functionally possible to reach, but only in erroneous executions that signify a mistake on the part of the LLVM developer elsewhere in the program.

Prefer llvm_unreachable() in any circumstance under which a code path is believed to be functionally impossible to reach (even if technically possible to reach). The API is now self-documenting to mean "this code really should be totally unreachable".

I think llvm_unreachable already has the functionality reporting bugs for developer in our implementation, with +Assertions by default

Yes, in terms of its runtime behavior. So long as we're not making debugging harder for some large group of people, the runtime behavior is not really what I'm concerned by though. I'm focusing more on code reviewers and project newcomers and whether our code is self-documenting or not. Having a policy to use an API that says code is not reachable in situations where that code is very much reachable is the crux of my problem -- the API is sometimes a lie (and a lie with optimization impacts, at that) and we force everyone to pay the cognitive costs associated with that when reading code.

If we end up with two APIs that have the same runtime behavior, I'm okay with that.

Could you elaborate on "aspirationally" vs "functionally" unreachable code here?

In D135551#3850132, @inclyc wrote:

In D135551#3849983, @aaron.ballman wrote:

In D135551#3849944, @inclyc wrote:

Prefer llvm_report_error() in any circumstance under which a code path is functionally possible to reach, but only in erroneous executions that signify a mistake on the part of the LLVM developer elsewhere in the program.

Prefer llvm_unreachable() in any circumstance under which a code path is believed to be functionally impossible to reach (even if technically possible to reach). The API is now self-documenting to mean "this code really should be totally unreachable".

I think llvm_unreachable already has the functionality reporting bugs for developer in our implementation, with +Assertions by default

Yes, in terms of its runtime behavior. So long as we're not making debugging harder for some large group of people, the runtime behavior is not really what I'm concerned by though. I'm focusing more on code reviewers and project newcomers and whether our code is self-documenting or not. Having a policy to use an API that says code is not reachable in situations where that code is very much reachable is the crux of my problem -- the API is sometimes a lie (and a lie with optimization impacts, at that) and we force everyone to pay the cognitive costs associated with that when reading code.

If we end up with two APIs that have the same runtime behavior, I'm okay with that.

Could you elaborate on "aspirationally" vs "functionally" unreachable code here?

Sure!

enum E { Zero, One, Two };

int func(E Val) {
  switch (Val) {
  case Zero: return 12;
  case One: return 200;
  case Two: return -1;
  }
  llvm_unreachable("never get here"); // Functionally unreachable; we can't think of a reasonable way to get here without other alarms going off
}

enum E { Zero, One, Two };

extern "C" int func(unsigned Val) {
  switch (Val) {
  case Zero: return 12;
  case One: return 200;
  case Two: return -1;
  }
  assert(0 && "never get here"); // Aspirationally unreachable; we HOPE we don't get here but it's plausible that we do if someone made a logical mistake calling the API
}

This makes sense! However I think assert(0) should not be used in this case, we could expose another llvm_unreachable-like api and probably llvm_report_error shall be fine. Are there some changed assertions actually "Aspirationally unreachable" in this patch?

In D135551#3850266, @inclyc wrote:

This makes sense! However I think assert(0) should not be used in this case, we could expose another llvm_unreachable-like api and probably llvm_report_error shall be fine. Are there some changed assertions actually "Aspirationally unreachable" in this patch?

No, I really don't think we should go down that path.

I believe these are not actually distinct cases - in either case, the program has UB if they violated the invariants/preconditions - whether or not they called through the C API.

unreachable is no more a guarantee/proven thing than an assertion - both are written by humans and a claim "if this is reached-or-false, there is a bug in some code, somewhere". The statement is not stronger in the unreachable case and the style guide supports that perspective and the way we triage/treat bugs is pretty consistent with that - we get bugs all the time when an unreachable is reached and that doesn't seem to surprise most/anyone - we treat it the same as a bug when an assertion fires.

The discourse discussion, I think, supports this ^ perspective.

As there's still disagreement, should this escalate to the RFC process to change the style guide, Aaron?

In D135551#3850308, @dblaikie wrote:

In D135551#3850266, @inclyc wrote:

This makes sense! However I think assert(0) should not be used in this case, we could expose another llvm_unreachable-like api and probably llvm_report_error shall be fine. Are there some changed assertions actually "Aspirationally unreachable" in this patch?

No, I really don't think we should go down that path.

I believe these are not actually distinct cases - in either case, the program has UB if they violated the invariants/preconditions - whether or not they called through the C API.

The C Index test cases I commented on earlier in the review are a good example of when there's no UB but we still want to alert people to the problem of code they should not be reaching. The assumption that "reached here unexpectedly" == "UB" is invalid. Some things are logic bugs that exhibit no UB.

unreachable is no more a guarantee/proven thing than an assertion - both are written by humans and a claim "if this is reached-or-false, there is a bug in some code, somewhere". The statement is not stronger in the unreachable case and the style guide supports that perspective and the way we triage/treat bugs is pretty consistent with that - we get bugs all the time when an unreachable is reached and that doesn't seem to surprise most/anyone - we treat it the same as a bug when an assertion fires.

The discourse discussion, I think, supports this ^ perspective.

As there's still disagreement, should this escalate to the RFC process to change the style guide, Aaron?

Yes, I would appreciate that. I don't think we're interpreting our policy the same way. Specifically "Use llvm_unreachable to mark a specific point in code that should never be reached." -- "should" is turning out to be interpreted in two ways:

"used to indicate obligation, duty, or correctness, typically when criticizing someone's actions. e.g., he should have been careful": I am asserting it is impossible to reach this.
"used to indicate what is probable. e.g., $348 million should be enough to buy him out": I am asserting you probably won't get here, but you won't be happy if you do.

In D135551#3850266, @inclyc wrote:

This makes sense! However I think assert(0) should not be used in this case, we could expose another llvm_unreachable-like api and probably llvm_report_error shall be fine. Are there some changed assertions actually "Aspirationally unreachable" in this patch?

I'm totally fine not using assert(0) and using an llvm_unreachable-like API (or even using a macro to dispatch to llvm_unreachable under a different name).

There are more aspirationally unreachable issues in this patch, I've commented on the ones I spotted, but I stopped commenting pretty quickly because I think a lot of the cases are made slightly worse by switching to llvm_unreachable instead of more targeted changes. I'd be especially curious to hear what @dblaikie thinks of the suggestions I have though -- it might be easier to see the distinction with real world code (or it might not!).

clang/include/clang/AST/Redeclarable.h
265	This looks like it should probably be: `assert(!PassedFirst && "Passed first decl twice, invalid redecl chain!");` rather than an `if` statement with recovery mechanisms.
clang/include/clang/StaticAnalyzer/Core/PathSensitive/MemRegion.h
601	This looks like it possibly should have been an error reported to the user? If not, this is a wonderful example of what I mean by aspirationally unreachable code. We aspire to not getting here -- but we can get here just the same and there is not UB as a result (we return a valid object, UB might happen elsewhere based on invalid assumptions of what is returned).
clang/lib/AST/ASTImporter.cpp
9976	According to our style guide, this probably should have been `assert(isa<ObjCInterfaceDecl, ObjCProtocolDecl, TagDecl>(D) && "CompleteDecl called on a Decl that can't be completed");` but IMO that's worse than using `assert(0)` because it's less maintainable (any time you add a new else if chain you have to also update the assert). I think `llvm_unreachable` is wrong to use here.
clang/lib/AST/ExprConstant.cpp
7561–7564	Probably should be `assert(Source != E && "OpaqueValueExpr recursively refers to itself");`
clang/lib/AST/Interp/ByteCodeExprGen.cpp
136–137	I think this can just be removed, right? There's another unreachable for falling out of the `switch` that seems to be covering the same situation.
598	The rest of the ones here are somewhat interesting in that the interpreter is an experiment under active development and is known to be incomplete. In all of these cases, I think the switch to unreachable is flat-out wrong -- these asserts serve explicitly to find unimplemented cases when we hit them.
clang/lib/Basic/SourceManager.cpp
62–65	`assert(Buffer != nullptr && "Buffer should never be null");` but that said, this one might be an optimization hint that suggests we should be using `__builtin_assume(Buffer != nullptr)`, I'm not certain.
863–866	`assert(SLocOffset >= CurrentLoadedOffset && "Invalid SLocOffset or bad function choice");`

In D135551#3850391, @aaron.ballman wrote:

In D135551#3850308, @dblaikie wrote:

In D135551#3850266, @inclyc wrote:

This makes sense! However I think assert(0) should not be used in this case, we could expose another llvm_unreachable-like api and probably llvm_report_error shall be fine. Are there some changed assertions actually "Aspirationally unreachable" in this patch?

No, I really don't think we should go down that path.

I believe these are not actually distinct cases - in either case, the program has UB if they violated the invariants/preconditions - whether or not they called through the C API.

The C Index test cases I commented on earlier in the review are a good example of when there's no UB but we still want to alert people to the problem of code they should not be reaching. The assumption that "reached here unexpectedly" == "UB" is invalid. Some things are logic bugs that exhibit no UB.

unreachable is no more a guarantee/proven thing than an assertion - both are written by humans and a claim "if this is reached-or-false, there is a bug in some code, somewhere". The statement is not stronger in the unreachable case and the style guide supports that perspective and the way we triage/treat bugs is pretty consistent with that - we get bugs all the time when an unreachable is reached and that doesn't seem to surprise most/anyone - we treat it the same as a bug when an assertion fires.

The discourse discussion, I think, supports this ^ perspective.

As there's still disagreement, should this escalate to the RFC process to change the style guide, Aaron?

Yes, I would appreciate that. I don't think we're interpreting our policy the same way. Specifically "Use llvm_unreachable to mark a specific point in code that should never be reached."

In the same way that an assert says "This condition should never be false" - I use "should" in the same sense in both unreachable and assert, and I believe that's the prevailing opinion of LLVM developers/the LLVM style guide.

Perhaps we are also at a deadlock as to who should write the proposal... :/

"should" is turning out to be interpreted in two ways:

"used to indicate obligation, duty, or correctness, typically when criticizing someone's actions. e.g., he should have been careful": I am asserting it is impossible to reach this.

This is the "should" ^ I mean, and what every assert should mean too. This code assumes this property to be true - this is a precondition of the code.

We should not be using asserts where we don't mean this. I'm OK assuming every assert does mean "this is a precondition" and treating them that way in terms of transforming them to unreachable or anything else we might do with them - and if some of them don't mean it, then they're buggy and we can fix them, but assert->unreachable doesn't make the situation any worse.

Any code behind/after an assert is untested and unvalidated - we can't say "if you violate this constraint it'll actually be OK" because we've never tested that/don't know that.

"used to indicate what is probable. e.g., $348 million should be enough to buy him out": I am asserting you probably won't get here, but you won't be happy if you do.

In D135551#3850266, @inclyc wrote:

This makes sense! However I think assert(0) should not be used in this case, we could expose another llvm_unreachable-like api and probably llvm_report_error shall be fine. Are there some changed assertions actually "Aspirationally unreachable" in this patch?

I'm totally fine not using assert(0) and using an llvm_unreachable-like API (or even using a macro to dispatch to llvm_unreachable under a different name).

There are more aspirationally unreachable issues in this patch, I've commented on the ones I spotted, but I stopped commenting pretty quickly because I think a lot of the cases are made slightly worse by switching to llvm_unreachable instead of more targeted changes. I'd be especially curious to hear what @dblaikie thinks of the suggestions I have though -- it might be easier to see the distinction with real world code (or it might not!).

Maybe - happy to talk about a few of the examples, but I'm not feeling super optimistic that we'll come to an understanding here, unfortunately :/

@rnk's comment here ( https://discourse.llvm.org/t/llvm-unreachable-is-widely-misused/60587/3 ) pretty well sums up my understanding/values here and it looks like on that thread, mostly the long term LLVM developers agree with this perspective and are trying to explain it to the (so far as I can tell) relative new/outside developer.

clang/include/clang/AST/Redeclarable.h
265	Yep, I've commented on similar things in the past - not sure we ever got it into the style guide, but "branch to unreachable" should be avoided in favor of assert-the-branch-condition (assuming it has no side effects, or if it does have side effects - do the thing, store the result in a local variable, void cast it to suppress unused-variable warnings and assert it) but I still think this is a valid transformation - it's just not the whole transformation, so I'm OK with things like this being done mechanically and then cleaned up further (possibly also mechanically) later.
clang/include/clang/StaticAnalyzer/Core/PathSensitive/MemRegion.h
601	"we" can get here in what sense? Is there source code that can be passed to the static analyzer that can reach this? Then it shouldn't be an assert, it should be exercised and tested - even if that test says "hey, this doesn't do anything good yet". I'd assume, whether I see the assert or the unreachable that it isn't reachable from clang on the command line - and that any code that makes it reachable/calls it under this condition is incorrect (for now, until support is added) and would be considered a bug to be fixed. Whether it's the assert or the unreachable, I consider it to be the same statement of intent.
clang/lib/AST/ASTImporter.cpp
9976	Yep, this is a case where branch-to-unreachable might be nicer than contorting the situation into `assert(isa...`) - actually, probably not even that. I'd likely change this code to unconditionally cast, relying on the cast's assertion. Why is llvm_unreachable wrong here? If you end up with a non-TagDecl here, you're certainly in UB territory - sooner or later, you're going to treat the non-TagDecl as a TagDecl and it's not going to be pretty.
clang/lib/AST/ExprConstant.cpp
7561–7564	Yep, though, again, I think it's a valid transformation even if it doesn't go as far as it could.
clang/lib/AST/Interp/ByteCodeExprGen.cpp
136–137	Presumably the switch isn't fully covered (otherwise we'd get a `-Wcovered-switch-default` warning). Might be that the unreachable after the switch can be removed in this case.
598	& I don't see why unreachable is any different a statement than assert(false) in these cases... - it's the same statement of intent. "if this is reached you've found a bug" (in this case, a missing feature) But I'd be sort of OK changing all these to report_fatal_error. But, again, I think the assert(false) -> unreachable is a valid transformation and doesn't make anything worse than it already is, but improves the code by being more consistent and removing this confusion that there might be something different about assert(false) when, I believe, there isn't.

In D135551#3850511, @dblaikie wrote:

In D135551#3850391, @aaron.ballman wrote:

In D135551#3850308, @dblaikie wrote:

In D135551#3850266, @inclyc wrote:

This makes sense! However I think assert(0) should not be used in this case, we could expose another llvm_unreachable-like api and probably llvm_report_error shall be fine. Are there some changed assertions actually "Aspirationally unreachable" in this patch?

No, I really don't think we should go down that path.

I believe these are not actually distinct cases - in either case, the program has UB if they violated the invariants/preconditions - whether or not they called through the C API.

The C Index test cases I commented on earlier in the review are a good example of when there's no UB but we still want to alert people to the problem of code they should not be reaching. The assumption that "reached here unexpectedly" == "UB" is invalid. Some things are logic bugs that exhibit no UB.

unreachable is no more a guarantee/proven thing than an assertion - both are written by humans and a claim "if this is reached-or-false, there is a bug in some code, somewhere". The statement is not stronger in the unreachable case and the style guide supports that perspective and the way we triage/treat bugs is pretty consistent with that - we get bugs all the time when an unreachable is reached and that doesn't seem to surprise most/anyone - we treat it the same as a bug when an assertion fires.

The discourse discussion, I think, supports this ^ perspective.

As there's still disagreement, should this escalate to the RFC process to change the style guide, Aaron?

Yes, I would appreciate that. I don't think we're interpreting our policy the same way. Specifically "Use llvm_unreachable to mark a specific point in code that should never be reached."

In the same way that an assert says "This condition should never be false" - I use "should" in the same sense in both unreachable and assert, and I believe that's the prevailing opinion of LLVM developers/the LLVM style guide.

I believe our code base says something different than our "prevailing opinion" and we should not be discounting the reality of how our code is written today.

Perhaps we are also at a deadlock as to who should write the proposal... :/

Agreed, you and I are probably both too close to the issue to write the proposal right now. If nobody else does it first, maybe the two of us could circle back in a few months after we've had time to research and think more deeply and we could co-write something (even if it is a multiple choice RFC between conflicting directions).

"should" is turning out to be interpreted in two ways:

"used to indicate obligation, duty, or correctness, typically when criticizing someone's actions. e.g., he should have been careful": I am asserting it is impossible to reach this.

This is the "should" ^ I mean, and what every assert should mean too. This code assumes this property to be true - this is a precondition of the code.

We should not be using asserts where we don't mean this. I'm OK assuming every assert does mean "this is a precondition" and treating them that way in terms of transforming them to unreachable or anything else we might do with them - and if some of them don't mean it, then they're buggy and we can fix them, but assert->unreachable doesn't make the situation any worse.

Any code behind/after an assert is untested and unvalidated - we can't say "if you violate this constraint it'll actually be OK" because we've never tested that/don't know that.

Just to double-check... are you opposed to the idea of differentiating between ways of saying "the reachability of this code is in question" or are you opposed to use of assert (or something that smells too similar) specifically? Because I don't care about *how* we spell the differentiation, just that there's not a policy limiting my ability to express my intent in code. I'd be perfectly fine with #define some_name_we_agree_upon(msg) llvm_unreachable(msg) being the facility we use instead of assert(0);.

"used to indicate what is probable. e.g., $348 million should be enough to buy him out": I am asserting you probably won't get here, but you won't be happy if you do.

I'm arguing that we have a not-insignificant amount of uses that have this^ interpretation and I want a way to express that distinction in code. I have an allergic reaction to using llvm_unreachable to express known-to-be-potentially reachable code because my complaint is that the annotation causes the code to be *less readable* as a result. I have to know about our community's novel definition of what "unreachable" means in order to read that code correctly. I think we'd be better served to leave "unreachable" annotations for places that we know we can't reach and use literally any other named API to express situations we know can potentially be reached (the assumption that we're going to have test coverage for all of these situations is a non-starter to me; until the community starts showing a stronger interest in tracking test coverage metrics and holding ourselves accountable for that coverage, it's not a compelling argument that "tests should catch this" because we know they won't).

@rnk's comment here ( https://discourse.llvm.org/t/llvm-unreachable-is-widely-misused/60587/3 ) pretty well sums up my understanding/values here and it looks like on that thread, mostly the long term LLVM developers agree with this perspective and are trying to explain it to the (so far as I can tell) relative new/outside developer.

I don't want to belabor this topic any longer as I think we've both said our pieces enough by now. I don't think we should make any code changes at this point unless there's agreement that the code is actually improved by the change. I think we can identify some of those noncontroversial cases in this review so that we get some benefit from all this effort. But I think we should leave anything that's contentious alone for the time being.

clang/include/clang/StaticAnalyzer/Core/PathSensitive/MemRegion.h
601	"we" can get here in what sense? Is there source code that can be passed to the static analyzer that can reach this? You are asking the question I claim the original source already answered and the new source fails to answer. Use of `assert(false)` like that tells me as a code reviewer "this code could potentially be reached and if it does, that's a problem" and so I know to go look for those cases to make sure we handle them properly, or if I've found a bug after the code was released I then have a better idea of what possible problems exist. Asserting "this is unreachable" tells me "the developer already did that audit and knows this is unreachable" and sends me down other, less productive paths before I finally go "oh, that annotation was lying to me, this isn't actually unreachable." Then it shouldn't be an assert, it should be exercised and tested - even if that test says "hey, this doesn't do anything good yet". I like that idea. In reality, we're nowhere near having test coverage like that (at least in Clang).
clang/lib/AST/ExprConstant.cpp
7561–7564	One thing that's worth noting -- the proposed code and my "probably" observation from above have the result of making the code less robust because they remove the perfectly reasonable recovery mechanism of saying there's an error. So they both go from "may give odd results but is unlikely to cause a vulnerability" to "vulnerability more possible" by losing that property.
clang/lib/AST/Interp/ByteCodeExprGen.cpp
136–137	Fair -- so long as we only end up with one unreachable, I think that's an improvement.
598	& I don't see why unreachable is any different a statement than assert(false) in these cases... - it's the same statement of intent. "if this is reached you've found a bug" (in this case, a missing feature) You are asserting it's the same statement of intent and I keep pointing out that people use the different constructs in practice because they're different statements of intent. I don't know how to resolve this difference of opinion, but I can say as someone doing code review in this area recently that your interpretation is wrong according to what we were after with this code. I'd be fine changing it to `report_fatal_error` instead of `assert(false)`; I'd be strongly opposed to switching to `llvm_unreachable`.

I think the disagreement here highlights the need to have a serious discussion about the future of error handling across the LLVM project. As you say, it sounds like you're not going to reach agreement on this code review, so maybe the best short term next step is to land the uncontroversial changes that Aaron agrees with.

Regarding error handling and the wide usage of assertions to guard UB across LLVM, we need to decide what our goals are as a community. Is it actually the goal of the project that Clang and LLVM that no input can lead to UB? If we can't guarantee that, is there some error budget we consider acceptable (fuzzer runs for 24hrs and can't find bugs)? How does that goal rate against our other goals, like performance? We could just ship with assertions enabled, sacrifice 20% code size and performance, and call it a day. We used to do that for Chromium, but users complained that compiles were too slow and we stopped doing it.

I think the status quo has real problems. We pretend that we can do both of these:

Assert liberally, with the understanding that assertion failures lead to UB (failed bad cast check, bounds checks, unreachable code, etc)
We can actually find and fix all cases that violate those inputs to the point that clang is stable and secure enough for our satisfaction

Currently, it is really easy to run fuzzers and find crash bugs in clang. I think the lesson we should take from that is that we are compromising goal 2 here, and we shouldn't kid ourselves about it.

Maybe the goal is not security, but is instead something about user or developer experience, but we should go through some higher level process to clarify that goal so we can write it down and agree on it.

clang/lib/Analysis/CFG.cpp
1047	This will create unreachable code warnings, which must be addressed before landing.

arsenm added a subscriber: arsenm.Oct 12 2022, 12:17 PM

arsenm added inline comments.

clang/lib/AST/Interp/ByteCodeExprGen.cpp
598	I use llvm_unreachable as a nicer to use assert in if/else chains like this. I also see no difference in the intent between assert and unreachable; assert(0 && "message") is just uglier. report_fatal_error is for something a user could plausibly run into but also isn't worth wiring into a proper error diagnostic (which happens a lot in codegen)

In D135551#3853365, @rnk wrote:

I think the status quo has real problems. We pretend that we can do both of these:

Assert liberally, with the understanding that assertion failures lead to UB (failed bad cast check, bounds checks, unreachable code, etc)

We can actually find and fix all cases that violate those inputs to the point that clang is stable and secure enough for our satisfaction

Currently, it is really easy to run fuzzers and find crash bugs in clang. I think the lesson we should take from that is that we are compromising goal 2 here, and we shouldn't kid ourselves about it.

Maybe the goal is not security, but is instead something about user or developer experience, but we should go through some higher level process to clarify that goal so we can write it down and agree on it.

+1 to all of this

dblaikie:

In the same way that an assert says "This condition should never be false" - I use "should" in the same sense in both unreachable and assert, and I believe that's the prevailing opinion of LLVM developers/the LLVM style guide.

aaronballman:

I believe our code base says something different than our "prevailing opinion" and we should not be discounting the reality of how our code is written today.

So, there are lots of ways and places where "the reality of how our code is written today" fails to conform to the style guide.
One conclusion is that the style guide is wrong and should be changed to reflect reality.
Another conclusion is that conforming to the style guide is the goal and one aspect of development is to strive to reach the goal. The imperfection of the state of the code today merely reflects an imperfect understanding of the goal.

I suggest a Round Table at the (less than one month away) Dev Meeting, assuming that the active parties in this discussion will be there. Put off any RFC until after that.

Revision Contents

Path

Size

clang/

include/

clang/

AST/

Redeclarable.h

2 lines

StaticAnalyzer/

Core/

PathSensitive/

MemRegion.h

2 lines

lib/

AST/

ASTImporter.cpp

2 lines

ExprConstant.cpp

2 lines

Interp/

ByteCodeExprGen.cpp

12 lines

Analysis/

CFG.cpp

2 lines

Basic/

SourceManager.cpp

8 lines

Targets/

NVPTX.cpp

2 lines

CodeGen/

CGHLSLRuntime.cpp

4 lines

CodeGenModule.cpp

3 lines

Driver/

Multilib.cpp

2 lines

ToolChains/

Flang.cpp

4 lines

Format/

Format.cpp

2 lines

Frontend/

Rewrite/

RewriteModernObjC.cpp

4 lines

RewriteObjC.cpp

2 lines

SARIFDiagnostic.cpp

10 lines

Lex/

PPDirectives.cpp

2 lines

PreprocessingRecord.cpp

4 lines

Parse/

ParseDecl.cpp

2 lines

Sema/

SemaChecking.cpp

2 lines

SemaCodeComplete.cpp

2 lines

Serialization/

ASTReader.cpp

4 lines

ASTWriter.cpp

8 lines

StaticAnalyzer/

Core/

CoreEngine.cpp

2 lines

SVals.cpp

2 lines

tools/

clang-refactor/

TestSupport.cpp

4 lines

libclang/

CIndex.cpp

6 lines

CXCursor.cpp

8 lines

utils/

TableGen/

ClangSyntaxEmitter.cpp

2 lines

Diff 466471

clang/include/clang/AST/Redeclarable.h

Show First 20 Lines • Show All 256 Lines • ▼ Show 20 Lines	public:
pointer operator->() const { return Current; }		pointer operator->() const { return Current; }

redecl_iterator& operator++() {		redecl_iterator& operator++() {
assert(Current && "Advancing while iterator has reached end");		assert(Current && "Advancing while iterator has reached end");
// Make sure we don't infinitely loop on an invalid redecl chain. This		// Make sure we don't infinitely loop on an invalid redecl chain. This
// should never happen.		// should never happen.
if (Current->isFirstDecl()) {		if (Current->isFirstDecl()) {
if (PassedFirst) {		if (PassedFirst) {
assert(0 && "Passed first decl twice, invalid redecl chain!");		llvm_unreachable("Passed first decl twice, invalid redecl chain!");
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions This looks like it should probably be: `assert(!PassedFirst && "Passed first decl twice, invalid redecl chain!");` rather than an `if` statement with recovery mechanisms. aaron.ballman: This looks like it should probably be: `assert(!PassedFirst && "Passed first decl twice…
		dblaikieUnsubmitted Not Done Reply Inline Actions Yep, I've commented on similar things in the past - not sure we ever got it into the style guide, but "branch to unreachable" should be avoided in favor of assert-the-branch-condition (assuming it has no side effects, or if it does have side effects - do the thing, store the result in a local variable, void cast it to suppress unused-variable warnings and assert it) but I still think this is a valid transformation - it's just not the whole transformation, so I'm OK with things like this being done mechanically and then cleaned up further (possibly also mechanically) later. dblaikie: Yep, I've commented on similar things in the past - not sure we ever got it into the style…
Current = nullptr;		Current = nullptr;
return *this;		return *this;
}		}
PassedFirst = true;		PassedFirst = true;
}		}

// Get either previous decl or latest decl.		// Get either previous decl or latest decl.
decl_type *Next = Current->getNextRedeclaration();		decl_type *Next = Current->getNextRedeclaration();
▲ Show 20 Lines • Show All 156 Lines • Show Last 20 Lines

clang/include/clang/StaticAnalyzer/Core/PathSensitive/MemRegion.h

	Show First 20 Lines • Show All 592 Lines • ▼ Show 20 Lines
	public:			public:
	QualType getLocationType() const override {			QualType getLocationType() const override {
	const ASTContext &Ctx = getContext();			const ASTContext &Ctx = getContext();
	if (const auto *D = dyn_cast<FunctionDecl>(FD)) {			if (const auto *D = dyn_cast<FunctionDecl>(FD)) {
	return Ctx.getPointerType(D->getType());			return Ctx.getPointerType(D->getType());
	}			}

	assert(isa<ObjCMethodDecl>(FD));			assert(isa<ObjCMethodDecl>(FD));
	assert(false && "Getting the type of ObjCMethod is not supported yet");			llvm_unreachable("Getting the type of ObjCMethod is not supported yet");
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions This looks like it possibly should have been an error reported to the user? If not, this is a wonderful example of what I mean by aspirationally unreachable code. We aspire to not getting here -- but we can get here just the same and there is not UB as a result (we return a valid object, UB might happen elsewhere based on invalid assumptions of what is returned). aaron.ballman: This looks like it possibly should have been an error reported to the user? If not, this is a…
				dblaikieUnsubmitted Not Done Reply Inline Actions "we" can get here in what sense? Is there source code that can be passed to the static analyzer that can reach this? Then it shouldn't be an assert, it should be exercised and tested - even if that test says "hey, this doesn't do anything good yet". I'd assume, whether I see the assert or the unreachable that it isn't reachable from clang on the command line - and that any code that makes it reachable/calls it under this condition is incorrect (for now, until support is added) and would be considered a bug to be fixed. Whether it's the assert or the unreachable, I consider it to be the same statement of intent. dblaikie: "we" can get here in what sense? Is there source code that can be passed to the static analyzer…
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions "we" can get here in what sense? Is there source code that can be passed to the static analyzer that can reach this? You are asking the question I claim the original source already answered and the new source fails to answer. Use of `assert(false)` like that tells me as a code reviewer "this code could potentially be reached and if it does, that's a problem" and so I know to go look for those cases to make sure we handle them properly, or if I've found a bug after the code was released I then have a better idea of what possible problems exist. Asserting "this is unreachable" tells me "the developer already did that audit and knows this is unreachable" and sends me down other, less productive paths before I finally go "oh, that annotation was lying to me, this isn't actually unreachable." Then it shouldn't be an assert, it should be exercised and tested - even if that test says "hey, this doesn't do anything good yet". I like that idea. In reality, we're nowhere near having test coverage like that (at least in Clang). aaron.ballman: > "we" can get here in what sense? Is there source code that can be passed to the static…

	// TODO: We might want to return a different type here (ex: id (*ty)(...))			// TODO: We might want to return a different type here (ex: id (*ty)(...))
	// depending on how it is used.			// depending on how it is used.
	return {};			return {};
	}			}

	const NamedDecl *getDecl() const {			const NamedDecl *getDecl() const {
	return FD;			return FD;
	▲ Show 20 Lines • Show All 1,010 Lines • Show Last 20 Lines

clang/lib/AST/ASTImporter.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 9,967 Lines • ▼ Show 20 Lines	void ASTImporter::CompleteDecl (Decl *D) {
}		}
else if (auto *TD = dyn_cast<TagDecl>(D)) {		else if (auto *TD = dyn_cast<TagDecl>(D)) {
if (!TD->getDefinition() && !TD->isBeingDefined()) {		if (!TD->getDefinition() && !TD->isBeingDefined()) {
TD->startDefinition();		TD->startDefinition();
TD->setCompleteDefinition(true);		TD->setCompleteDefinition(true);
}		}
}		}
else {		else {
assert(0 && "CompleteDecl called on a Decl that can't be completed");		llvm_unreachable("CompleteDecl called on a Decl that can't be completed");
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions According to our style guide, this probably should have been `assert(isa<ObjCInterfaceDecl, ObjCProtocolDecl, TagDecl>(D) && "CompleteDecl called on a Decl that can't be completed");` but IMO that's worse than using `assert(0)` because it's less maintainable (any time you add a new else if chain you have to also update the assert). I think `llvm_unreachable` is wrong to use here. aaron.ballman: According to our style guide, this probably should have been `assert(isa<ObjCInterfaceDecl…
		dblaikieUnsubmitted Not Done Reply Inline Actions Yep, this is a case where branch-to-unreachable might be nicer than contorting the situation into `assert(isa...`) - actually, probably not even that. I'd likely change this code to unconditionally cast, relying on the cast's assertion. Why is llvm_unreachable wrong here? If you end up with a non-TagDecl here, you're certainly in UB territory - sooner or later, you're going to treat the non-TagDecl as a TagDecl and it's not going to be pretty. dblaikie: Yep, this is a case where branch-to-unreachable might be nicer than contorting the situation…
}		}
}		}

Decl ASTImporter::MapImported(Decl From, Decl *To) {		Decl ASTImporter::MapImported(Decl From, Decl *To) {
llvm::DenseMap<Decl , Decl >::iterator Pos = ImportedDecls.find(From);		llvm::DenseMap<Decl , Decl >::iterator Pos = ImportedDecls.find(From);
assert((Pos == ImportedDecls.end() \|\| Pos->second == To) &&		assert((Pos == ImportedDecls.end() \|\| Pos->second == To) &&
"Try to import an already imported Decl");		"Try to import an already imported Decl");
if (Pos != ImportedDecls.end())		if (Pos != ImportedDecls.end())
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

clang/lib/AST/ExprConstant.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,552 Lines • ▼ Show 20 Lines	public:

bool VisitOpaqueValueExpr(const OpaqueValueExpr *E) {		bool VisitOpaqueValueExpr(const OpaqueValueExpr *E) {
if (APValue *Value = Info.CurrentCall->getCurrentTemporary(E))		if (APValue *Value = Info.CurrentCall->getCurrentTemporary(E))
return DerivedSuccess(*Value, E);		return DerivedSuccess(*Value, E);

const Expr *Source = E->getSourceExpr();		const Expr *Source = E->getSourceExpr();
if (!Source)		if (!Source)
return Error(E);		return Error(E);
if (Source == E) {		if (Source == E) {
assert(0 && "OpaqueValueExpr recursively refers to itself");		llvm_unreachable("OpaqueValueExpr recursively refers to itself");
return Error(E);		return Error(E);
}		}
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Probably should be `assert(Source != E && "OpaqueValueExpr recursively refers to itself");` aaron.ballman: Probably should be `assert(Source != E && "OpaqueValueExpr recursively refers to itself");`
		dblaikieUnsubmitted Not Done Reply Inline Actions Yep, though, again, I think it's a valid transformation even if it doesn't go as far as it could. dblaikie: Yep, though, again, I think it's a valid transformation even if it doesn't go as far as it…
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions One thing that's worth noting -- the proposed code and my "probably" observation from above have the result of making the code less robust because they remove the perfectly reasonable recovery mechanism of saying there's an error. So they both go from "may give odd results but is unlikely to cause a vulnerability" to "vulnerability more possible" by losing that property. aaron.ballman: One thing that's worth noting -- the proposed code and my "probably" observation from above…
return StmtVisitorTy::Visit(Source);		return StmtVisitorTy::Visit(Source);
}		}

bool VisitPseudoObjectExpr(const PseudoObjectExpr *E) {		bool VisitPseudoObjectExpr(const PseudoObjectExpr *E) {
for (const Expr *SemE : E->semantics()) {		for (const Expr *SemE : E->semantics()) {
if (auto *OVE = dyn_cast<OpaqueValueExpr>(SemE)) {		if (auto *OVE = dyn_cast<OpaqueValueExpr>(SemE)) {
// FIXME: We can't handle the case where an OpaqueValueExpr is also the		// FIXME: We can't handle the case where an OpaqueValueExpr is also the
// result expression: there could be two different LValues that would		// result expression: there could be two different LValues that would
▲ Show 20 Lines • Show All 8,556 Lines • Show Last 20 Lines

clang/lib/AST/Interp/ByteCodeExprGen.cpp

Show First 20 Lines • Show All 127 Lines • ▼ Show 20 Lines	if (!this->Visit(SubExpr))
return false;		return false;

return this->emitCast(FromT, ToT, CE);		return this->emitCast(FromT, ToT, CE);
}		}

case CK_ToVoid:		case CK_ToVoid:
return discard(SubExpr);		return discard(SubExpr);

default:		default:
assert(false && "Cast not implemented");		llvm_unreachable("Cast not implemented");
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions I think this can just be removed, right? There's another unreachable for falling out of the `switch` that seems to be covering the same situation. aaron.ballman: I think this can just be removed, right? There's another unreachable for falling out of the…
		dblaikieUnsubmitted Not Done Reply Inline Actions Presumably the switch isn't fully covered (otherwise we'd get a `-Wcovered-switch-default` warning). Might be that the unreachable after the switch can be removed in this case. dblaikie: Presumably the switch isn't fully covered (otherwise we'd get a `-Wcovered-switch-default`…
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Fair -- so long as we only end up with one unreachable, I think that's an improvement. aaron.ballman: Fair -- so long as we only end up with one unreachable, I think that's an improvement.
}		}
llvm_unreachable("Unhandled clang::CastKind enum");		llvm_unreachable("Unhandled clang::CastKind enum");
}		}

template <class Emitter>		template <class Emitter>
bool ByteCodeExprGen<Emitter>::VisitIntegerLiteral(const IntegerLiteral *LE) {		bool ByteCodeExprGen<Emitter>::VisitIntegerLiteral(const IntegerLiteral *LE) {
if (DiscardResult)		if (DiscardResult)
return true;		return true;
▲ Show 20 Lines • Show All 444 Lines • ▼ Show 20 Lines	for (const Expr *Init : InitList->inits()) {
return false;		return false;
} else if (Optional<PrimType> T = classify(InitType)) {		} else if (Optional<PrimType> T = classify(InitType)) {
// Visit the primitive element like normal.		// Visit the primitive element like normal.
if (!this->visit(Init))		if (!this->visit(Init))
return false;		return false;
if (!this->emitInitElem(*T, ElementIndex, Init))		if (!this->emitInitElem(*T, ElementIndex, Init))
return false;		return false;
} else {		} else {
assert(false && "Unhandled type in array initializer initlist");		llvm_unreachable("Unhandled type in array initializer initlist");
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions The rest of the ones here are somewhat interesting in that the interpreter is an experiment under active development and is known to be incomplete. In all of these cases, I think the switch to unreachable is flat-out wrong -- these asserts serve explicitly to find unimplemented cases when we hit them. aaron.ballman: The rest of the ones here are somewhat interesting in that the interpreter is an experiment…
		dblaikieUnsubmitted Not Done Reply Inline Actions & I don't see why unreachable is any different a statement than assert(false) in these cases... - it's the same statement of intent. "if this is reached you've found a bug" (in this case, a missing feature) But I'd be sort of OK changing all these to report_fatal_error. But, again, I think the assert(false) -> unreachable is a valid transformation and doesn't make anything worse than it already is, but improves the code by being more consistent and removing this confusion that there might be something different about assert(false) when, I believe, there isn't. dblaikie: & I don't see why unreachable is any different a statement than assert(false) in these cases...
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions & I don't see why unreachable is any different a statement than assert(false) in these cases... - it's the same statement of intent. "if this is reached you've found a bug" (in this case, a missing feature) You are asserting it's the same statement of intent and I keep pointing out that people use the different constructs in practice because they're different statements of intent. I don't know how to resolve this difference of opinion, but I can say as someone doing code review in this area recently that your interpretation is wrong according to what we were after with this code. I'd be fine changing it to `report_fatal_error` instead of `assert(false)`; I'd be strongly opposed to switching to `llvm_unreachable`. aaron.ballman: > & I don't see why unreachable is any different a statement than assert(false) in these cases..
		arsenmUnsubmitted Not Done Reply Inline Actions I use llvm_unreachable as a nicer to use assert in if/else chains like this. I also see no difference in the intent between assert and unreachable; assert(0 && "message") is just uglier. report_fatal_error is for something a user could plausibly run into but also isn't worth wiring into a proper error diagnostic (which happens a lot in codegen) arsenm: I use llvm_unreachable as a nicer to use assert in if/else chains like this. I also see no…
}		}

++ElementIndex;		++ElementIndex;
}		}

} else {		} else {
assert(false && "Unknown expression for array initialization");		llvm_unreachable("Unknown expression for array initialization");
}		}

return true;		return true;
}		}

template <class Emitter>		template <class Emitter>
bool ByteCodeExprGen<Emitter>::visitInitializer(const Expr *Initializer) {		bool ByteCodeExprGen<Emitter>::visitInitializer(const Expr *Initializer) {
QualType InitializerType = Initializer->getType();		QualType InitializerType = Initializer->getType();
▲ Show 20 Lines • Show All 135 Lines • ▼ Show 20 Lines	if (T \|\| ReturnType->isVoidType()) {
if (!this->visit(Arg))		if (!this->visit(Arg))
return false;		return false;
}		}

if (T)		if (T)
return this->emitCall(*T, Func, E);		return this->emitCall(*T, Func, E);
return this->emitCallVoid(Func, E);		return this->emitCallVoid(Func, E);
} else {		} else {
assert(false && "Can't classify function return type");		llvm_unreachable("Can't classify function return type");
}		}

} else {		} else {
assert(false && "We don't support non-FunctionDecl callees right now.");		llvm_unreachable("We don't support non-FunctionDecl callees right now.");
}		}

return false;		return false;
}		}

template <class Emitter>		template <class Emitter>
bool ByteCodeExprGen<Emitter>::VisitCXXDefaultArgExpr(		bool ByteCodeExprGen<Emitter>::VisitCXXDefaultArgExpr(
const CXXDefaultArgExpr *E) {		const CXXDefaultArgExpr *E) {
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	return dereference(
[this, E](PrimType T) {		[this, E](PrimType T) {
return DiscardResult ? this->emitPop(T, E) : true;		return DiscardResult ? this->emitPop(T, E) : true;
});		});
case UO_Not: // ~x		case UO_Not: // ~x
case UO_Real: // __real x		case UO_Real: // __real x
case UO_Imag: // __imag x		case UO_Imag: // __imag x
case UO_Extension:		case UO_Extension:
case UO_Coawait:		case UO_Coawait:
assert(false && "Unhandled opcode");		llvm_unreachable("Unhandled opcode");
}		}

return false;		return false;
}		}

template <class Emitter>		template <class Emitter>
bool ByteCodeExprGen<Emitter>::VisitDeclRefExpr(const DeclRefExpr *E) {		bool ByteCodeExprGen<Emitter>::VisitDeclRefExpr(const DeclRefExpr *E) {
const auto *Decl = E->getDecl();		const auto *Decl = E->getDecl();
▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

clang/lib/Analysis/CFG.cpp

Show First 20 Lines • Show All 1,037 Lines • ▼ Show 20 Lines	if (const auto *UnOp = dyn_cast<UnaryOperator>(E->IgnoreParens())) {
return Value;		return Value;
case UO_Minus:		case UO_Minus:
return -Value;		return -Value;
case UO_Not:		case UO_Not:
return ~Value;		return ~Value;
case UO_LNot:		case UO_LNot:
return llvm::APInt(Context->getTypeSize(Context->IntTy), !Value);		return llvm::APInt(Context->getTypeSize(Context->IntTy), !Value);
default:		default:
assert(false && "Unexpected unary operator!");		llvm_unreachable("Unexpected unary operator!");
return llvm::None;		return llvm::None;
		rnkUnsubmitted Not Done Reply Inline Actions This will create unreachable code warnings, which must be addressed before landing. rnk: This will create unreachable code warnings, which must be addressed before landing.
}		}
}		}
} else if (const auto *IntLiteral =		} else if (const auto *IntLiteral =
dyn_cast<IntegerLiteral>(E->IgnoreParens()))		dyn_cast<IntegerLiteral>(E->IgnoreParens()))
return IntLiteral->getValue();		return IntLiteral->getValue();

return llvm::None;		return llvm::None;
}		}
▲ Show 20 Lines • Show All 5,300 Lines • Show Last 20 Lines

clang/lib/Basic/SourceManager.cpp

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
/// ContentCache. This can be 0 if the MemBuffer was not actually expanded.		/// ContentCache. This can be 0 if the MemBuffer was not actually expanded.
unsigned ContentCache::getSizeBytesMapped() const {		unsigned ContentCache::getSizeBytesMapped() const {
return Buffer ? Buffer->getBufferSize() : 0;		return Buffer ? Buffer->getBufferSize() : 0;
}		}

/// Returns the kind of memory used to back the memory buffer for		/// Returns the kind of memory used to back the memory buffer for
/// this content cache. This is used for performance analysis.		/// this content cache. This is used for performance analysis.
llvm::MemoryBuffer::BufferKind ContentCache::getMemoryBufferKind() const {		llvm::MemoryBuffer::BufferKind ContentCache::getMemoryBufferKind() const {
if (Buffer == nullptr) {		if (Buffer == nullptr) {
assert(0 && "Buffer should never be null");		llvm_unreachable("Buffer should never be null");
return llvm::MemoryBuffer::MemoryBuffer_Malloc;		return llvm::MemoryBuffer::MemoryBuffer_Malloc;
}		}
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions `assert(Buffer != nullptr && "Buffer should never be null");` but that said, this one might be an optimization hint that suggests we should be using `__builtin_assume(Buffer != nullptr)`, I'm not certain. aaron.ballman: `assert(Buffer != nullptr && "Buffer should never be null");` but that said, this one might be…
return Buffer->getBufferKind();		return Buffer->getBufferKind();
}		}

/// getSize - Returns the size of the content encapsulated by this ContentCache.		/// getSize - Returns the size of the content encapsulated by this ContentCache.
/// This can be the size of the source file or the size of an arbitrary		/// This can be the size of the source file or the size of an arbitrary
/// scratch buffer. If the ContentCache encapsulates a source file, that		/// scratch buffer. If the ContentCache encapsulates a source file, that
/// file is not lazily brought in from disk to satisfy this query.		/// file is not lazily brought in from disk to satisfy this query.
unsigned ContentCache::getSize() const {		unsigned ContentCache::getSize() const {
▲ Show 20 Lines • Show All 781 Lines • ▼ Show 20 Lines	FileID SourceManager::getFileIDLocal(SourceLocation::UIntTy SLocOffset) const {
}		}
}		}

/// Return the FileID for a SourceLocation with a high offset.		/// Return the FileID for a SourceLocation with a high offset.
///		///
/// This function knows that the SourceLocation is in a loaded buffer, not a		/// This function knows that the SourceLocation is in a loaded buffer, not a
/// local one.		/// local one.
FileID SourceManager::getFileIDLoaded(SourceLocation::UIntTy SLocOffset) const {		FileID SourceManager::getFileIDLoaded(SourceLocation::UIntTy SLocOffset) const {
if (SLocOffset < CurrentLoadedOffset) {		if (SLocOffset < CurrentLoadedOffset) {
assert(0 && "Invalid SLocOffset or bad function choice");		llvm_unreachable("Invalid SLocOffset or bad function choice");
return FileID();		return FileID();
}		}
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions `assert(SLocOffset >= CurrentLoadedOffset && "Invalid SLocOffset or bad function choice");` aaron.ballman: `assert(SLocOffset >= CurrentLoadedOffset && "Invalid SLocOffset or bad function choice");`

// Essentially the same as the local case, but the loaded array is sorted		// Essentially the same as the local case, but the loaded array is sorted
// in the other direction (decreasing order).		// in the other direction (decreasing order).
// GreaterIndex is the one where the offset is greater, which is actually a		// GreaterIndex is the one where the offset is greater, which is actually a
// lower index!		// lower index!
unsigned GreaterIndex = 0;		unsigned GreaterIndex = 0;
unsigned LessIndex = LoadedSLocEntryTable.size();		unsigned LessIndex = LoadedSLocEntryTable.size();
if (LastFileIDLookup.ID < 0) {		if (LastFileIDLookup.ID < 0) {
Show All 28 Lines	while (true) {
++NumProbes;		++NumProbes;
unsigned MiddleIndex = (LessIndex - GreaterIndex) / 2 + GreaterIndex;		unsigned MiddleIndex = (LessIndex - GreaterIndex) / 2 + GreaterIndex;
const SrcMgr::SLocEntry &E = getLoadedSLocEntry(MiddleIndex, &Invalid);		const SrcMgr::SLocEntry &E = getLoadedSLocEntry(MiddleIndex, &Invalid);
if (Invalid)		if (Invalid)
return FileID(); // invalid entry.		return FileID(); // invalid entry.

if (E.getOffset() > SLocOffset) {		if (E.getOffset() > SLocOffset) {
if (GreaterIndex == MiddleIndex) {		if (GreaterIndex == MiddleIndex) {
assert(0 && "binary search missed the entry");		llvm_unreachable("binary search missed the entry");
return FileID();		return FileID();
}		}
GreaterIndex = MiddleIndex;		GreaterIndex = MiddleIndex;
continue;		continue;
}		}

if (isOffsetInFileID(FileID::get(-int(MiddleIndex) - 2), SLocOffset)) {		if (isOffsetInFileID(FileID::get(-int(MiddleIndex) - 2), SLocOffset)) {
FileID Res = FileID::get(-int(MiddleIndex) - 2);		FileID Res = FileID::get(-int(MiddleIndex) - 2);
LastFileIDLookup = Res;		LastFileIDLookup = Res;
NumBinaryProbes += NumProbes;		NumBinaryProbes += NumProbes;
return Res;		return Res;
}		}

if (LessIndex == MiddleIndex) {		if (LessIndex == MiddleIndex) {
assert(0 && "binary search missed the entry");		llvm_unreachable("binary search missed the entry");
return FileID();		return FileID();
}		}
LessIndex = MiddleIndex;		LessIndex = MiddleIndex;
}		}
}		}

SourceLocation SourceManager::		SourceLocation SourceManager::
getExpansionLocSlowCase(SourceLocation Loc) const {		getExpansionLocSlowCase(SourceLocation Loc) const {
▲ Show 20 Lines • Show All 1,355 Lines • Show Last 20 Lines

clang/lib/Basic/Targets/NVPTX.cpp

Show First 20 Lines • Show All 204 Lines • ▼ Show 20 Lines	std::string CUDAArchCode = [this] {
case CudaArch::GFX1101:		case CudaArch::GFX1101:
case CudaArch::GFX1102:		case CudaArch::GFX1102:
case CudaArch::GFX1103:		case CudaArch::GFX1103:
case CudaArch::Generic:		case CudaArch::Generic:
case CudaArch::LAST:		case CudaArch::LAST:
break;		break;
case CudaArch::UNUSED:		case CudaArch::UNUSED:
case CudaArch::UNKNOWN:		case CudaArch::UNKNOWN:
assert(false && "No GPU arch when compiling CUDA device code.");		llvm_unreachable("No GPU arch when compiling CUDA device code.");
return "";		return "";
case CudaArch::SM_20:		case CudaArch::SM_20:
return "200";		return "200";
case CudaArch::SM_21:		case CudaArch::SM_21:
return "210";		return "210";
case CudaArch::SM_30:		case CudaArch::SM_30:
return "300";		return "300";
case CudaArch::SM_32:		case CudaArch::SM_32:
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGHLSLRuntime.cpp

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	void CGHLSLRuntime::annotateHLSLResource(const VarDecl D, GlobalVariable GV) {
uint32_t Counter = ResourceCounters[static_cast<uint32_t>(RC)]++;		uint32_t Counter = ResourceCounters[static_cast<uint32_t>(RC)]++;

NamedMDNode *ResourceMD = nullptr;		NamedMDNode *ResourceMD = nullptr;
switch (RC) {		switch (RC) {
case HLSLResourceAttr::ResourceClass::UAV:		case HLSLResourceAttr::ResourceClass::UAV:
ResourceMD = CGM.getModule().getOrInsertNamedMetadata("hlsl.uavs");		ResourceMD = CGM.getModule().getOrInsertNamedMetadata("hlsl.uavs");
break;		break;
default:		default:
assert(false && "Unsupported buffer type!");		llvm_unreachable("Unsupported buffer type!");
return;		return;
}		}

assert(ResourceMD != nullptr &&		assert(ResourceMD != nullptr &&
"ResourceMD must have been set by the switch above.");		"ResourceMD must have been set by the switch above.");

auto &Ctx = CGM.getModule().getContext();		auto &Ctx = CGM.getModule().getContext();
IRBuilder<> B(Ctx);		IRBuilder<> B(Ctx);
Show All 22 Lines
llvm::Value *CGHLSLRuntime::emitInputSemantic(IRBuilder<> &B,		llvm::Value *CGHLSLRuntime::emitInputSemantic(IRBuilder<> &B,
const ParmVarDecl &D) {		const ParmVarDecl &D) {
assert(D.hasAttrs() && "Entry parameter missing annotation attribute!");		assert(D.hasAttrs() && "Entry parameter missing annotation attribute!");
if (D.hasAttr<HLSLSV_GroupIndexAttr>()) {		if (D.hasAttr<HLSLSV_GroupIndexAttr>()) {
llvm::Function *DxGroupIndex =		llvm::Function *DxGroupIndex =
CGM.getIntrinsic(Intrinsic::dx_flattened_thread_id_in_group);		CGM.getIntrinsic(Intrinsic::dx_flattened_thread_id_in_group);
return B.CreateCall(FunctionCallee(DxGroupIndex));		return B.CreateCall(FunctionCallee(DxGroupIndex));
}		}
assert(false && "Unhandled parameter attribute");		llvm_unreachable("Unhandled parameter attribute");
return nullptr;		return nullptr;
}		}

void CGHLSLRuntime::emitEntryFunction(const FunctionDecl *FD,		void CGHLSLRuntime::emitEntryFunction(const FunctionDecl *FD,
llvm::Function *Fn) {		llvm::Function *Fn) {
llvm::Module &M = CGM.getModule();		llvm::Module &M = CGM.getModule();
llvm::LLVMContext &Ctx = M.getContext();		llvm::LLVMContext &Ctx = M.getContext();
auto *EntryTy = llvm::FunctionType::get(llvm::Type::getVoidTy(Ctx), false);		auto *EntryTy = llvm::FunctionType::get(llvm::Type::getVoidTy(Ctx), false);
▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenModule.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,697 Lines • ▼ Show 20 Lines	if (FD->isTargetMultiVersion()) {
if (Version.startswith("arch="))		if (Version.startswith("arch="))
Architecture = Version.drop_front(sizeof("arch=") - 1);		Architecture = Version.drop_front(sizeof("arch=") - 1);
else if (Version != "default")		else if (Version != "default")
Feature.push_back(Version);		Feature.push_back(Version);

Options.emplace_back(cast<llvm::Function>(Func), Architecture, Feature);		Options.emplace_back(cast<llvm::Function>(Func), Architecture, Feature);
}		}
} else {		} else {
assert(0 && "Expected a target or target_clones multiversion function");		llvm_unreachable(
		"Expected a target or target_clones multiversion function");
continue;		continue;
}		}

llvm::Constant *ResolverConstant = GetOrCreateMultiVersionResolver(GD);		llvm::Constant *ResolverConstant = GetOrCreateMultiVersionResolver(GD);
if (auto *IFunc = dyn_cast<llvm::GlobalIFunc>(ResolverConstant))		if (auto *IFunc = dyn_cast<llvm::GlobalIFunc>(ResolverConstant))
ResolverConstant = IFunc->getResolver();		ResolverConstant = IFunc->getResolver();
llvm::Function *ResolverFunc = cast<llvm::Function>(ResolverConstant);		llvm::Function *ResolverFunc = cast<llvm::Function>(ResolverConstant);

▲ Show 20 Lines • Show All 3,380 Lines • Show Last 20 Lines

clang/lib/Driver/Multilib.cpp

Show First 20 Lines • Show All 271 Lines • ▼ Show 20 Lines	bool MultilibSet::select(const Multilib::flags_list &Flags, Multilib &M) const {
});		});

if (Filtered[0].priority() > Filtered[1].priority()) {		if (Filtered[0].priority() > Filtered[1].priority()) {
M = Filtered[0];		M = Filtered[0];
return true;		return true;
}		}

// TODO: We should consider returning llvm::Error rather than aborting.		// TODO: We should consider returning llvm::Error rather than aborting.
assert(false && "More than one multilib with the same priority");		llvm_unreachable("More than one multilib with the same priority");
return false;		return false;
}		}

LLVM_DUMP_METHOD void MultilibSet::dump() const {		LLVM_DUMP_METHOD void MultilibSet::dump() const {
print(llvm::errs());		print(llvm::errs());
}		}

void MultilibSet::print(raw_ostream &OS) const {		void MultilibSet::print(raw_ostream &OS) const {
Show All 19 Lines

clang/lib/Driver/ToolChains/Flang.cpp

Show First 20 Lines • Show All 106 Lines • ▼ Show 20 Lines	if (JA.getType() == types::TY_Nothing) {
JA.getType() == types::TY_LTO_IR) {		JA.getType() == types::TY_LTO_IR) {
CmdArgs.push_back("-emit-llvm");		CmdArgs.push_back("-emit-llvm");
} else if (JA.getType() == types::TY_LLVM_BC \|\|		} else if (JA.getType() == types::TY_LLVM_BC \|\|
JA.getType() == types::TY_LTO_BC) {		JA.getType() == types::TY_LTO_BC) {
CmdArgs.push_back("-emit-llvm-bc");		CmdArgs.push_back("-emit-llvm-bc");
} else if (JA.getType() == types::TY_PP_Asm) {		} else if (JA.getType() == types::TY_PP_Asm) {
CmdArgs.push_back("-S");		CmdArgs.push_back("-S");
} else {		} else {
assert(false && "Unexpected output type!");		llvm_unreachable("Unexpected output type!");
}		}
} else if (isa<AssembleJobAction>(JA)) {		} else if (isa<AssembleJobAction>(JA)) {
CmdArgs.push_back("-emit-obj");		CmdArgs.push_back("-emit-obj");
} else {		} else {
assert(false && "Unexpected action class for Flang tool.");		llvm_unreachable("Unexpected action class for Flang tool.");
}		}

const InputInfo &Input = Inputs[0];		const InputInfo &Input = Inputs[0];
types::ID InputType = Input.getType();		types::ID InputType = Input.getType();

// Add preprocessing options like -I, -D, etc. if we are using the		// Add preprocessing options like -I, -D, etc. if we are using the
// preprocessor (i.e. skip when dealing with e.g. binary files).		// preprocessor (i.e. skip when dealing with e.g. binary files).
if (types::getPreprocessedType(InputType) != types::TY_INVALID)		if (types::getPreprocessedType(InputType) != types::TY_INVALID)
▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

clang/lib/Format/Format.cpp

Show First 20 Lines • Show All 2,470 Lines • ▼ Show 20 Lines	while (Idx < Tokens.size()) {
auto SR = CharSourceRange::getCharRange(Tokens[St]->Tok.getLocation(),		auto SR = CharSourceRange::getCharRange(Tokens[St]->Tok.getLocation(),
Tokens[End]->Tok.getEndLoc());		Tokens[End]->Tok.getEndLoc());
auto Err =		auto Err =
Fixes.add(tooling::Replacement(Env.getSourceManager(), SR, ""));		Fixes.add(tooling::Replacement(Env.getSourceManager(), SR, ""));
// FIXME: better error handling. for now just print error message and skip		// FIXME: better error handling. for now just print error message and skip
// for the release version.		// for the release version.
if (Err) {		if (Err) {
llvm::errs() << llvm::toString(std::move(Err)) << "\n";		llvm::errs() << llvm::toString(std::move(Err)) << "\n";
assert(false && "Fixes must not conflict!");		llvm_unreachable("Fixes must not conflict!");
}		}
Idx = End + 1;		Idx = End + 1;
}		}

return Fixes;		return Fixes;
}		}

// Class for less-than inequality comparason for the set `RedundantTokens`.		// Class for less-than inequality comparason for the set `RedundantTokens`.
▲ Show 20 Lines • Show All 1,265 Lines • Show Last 20 Lines

clang/lib/Frontend/Rewrite/RewriteModernObjC.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,640 Lines • ▼ Show 20 Lines	bool RewriteModernObjC::RewriteObjCFieldDeclType(QualType &Type,
else if (Type->isRecordType()) {		else if (Type->isRecordType()) {
RecordDecl *RD = Type->castAs<RecordType>()->getDecl();		RecordDecl *RD = Type->castAs<RecordType>()->getDecl();
if (RD->isCompleteDefinition()) {		if (RD->isCompleteDefinition()) {
if (RD->isStruct())		if (RD->isStruct())
Result += "\n\tstruct ";		Result += "\n\tstruct ";
else if (RD->isUnion())		else if (RD->isUnion())
Result += "\n\tunion ";		Result += "\n\tunion ";
else		else
assert(false && "class not allowed as an ivar type");		llvm_unreachable("class not allowed as an ivar type");

Result += RD->getName();		Result += RD->getName();
if (GlobalDefinedTags.count(RD)) {		if (GlobalDefinedTags.count(RD)) {
// struct/union is defined globally, use it.		// struct/union is defined globally, use it.
Result += " ";		Result += " ";
return true;		return true;
}		}
Result += " {\n";		Result += " {\n";
▲ Show 20 Lines • Show All 917 Lines • ▼ Show 20 Lines	ConditionalOperator *CondExpr = new (Context) ConditionalOperator(
cast<Expr>(RHSStmt), Exp->getType(), VK_PRValue, OK_Ordinary);		cast<Expr>(RHSStmt), Exp->getType(), VK_PRValue, OK_Ordinary);
return CondExpr;		return CondExpr;
} else if (const ObjCIvarRefExpr *IRE = dyn_cast<ObjCIvarRefExpr>(BlockExp)) {		} else if (const ObjCIvarRefExpr *IRE = dyn_cast<ObjCIvarRefExpr>(BlockExp)) {
CPT = IRE->getType()->getAs<BlockPointerType>();		CPT = IRE->getType()->getAs<BlockPointerType>();
} else if (const PseudoObjectExpr *POE		} else if (const PseudoObjectExpr *POE
= dyn_cast<PseudoObjectExpr>(BlockExp)) {		= dyn_cast<PseudoObjectExpr>(BlockExp)) {
CPT = POE->getType()->castAs<BlockPointerType>();		CPT = POE->getType()->castAs<BlockPointerType>();
} else {		} else {
assert(false && "RewriteBlockClass: Bad type");		llvm_unreachable("RewriteBlockClass: Bad type");
}		}
assert(CPT && "RewriteBlockClass: Bad type");		assert(CPT && "RewriteBlockClass: Bad type");
const FunctionType *FT = CPT->getPointeeType()->getAs<FunctionType>();		const FunctionType *FT = CPT->getPointeeType()->getAs<FunctionType>();
assert(FT && "RewriteBlockClass: Bad type");		assert(FT && "RewriteBlockClass: Bad type");
const FunctionProtoType *FTP = dyn_cast<FunctionProtoType>(FT);		const FunctionProtoType *FTP = dyn_cast<FunctionProtoType>(FT);
// FTP will be null for closures that don't take arguments.		// FTP will be null for closures that don't take arguments.

RecordDecl RD = RecordDecl::Create(Context, TTK_Struct, TUDecl,		RecordDecl RD = RecordDecl::Create(Context, TTK_Struct, TUDecl,
▲ Show 20 Lines • Show All 2,978 Lines • Show Last 20 Lines

clang/lib/Frontend/Rewrite/RewriteObjC.cpp

Show First 20 Lines • Show All 3,741 Lines • ▼ Show 20 Lines	ConditionalOperator *CondExpr = new (Context) ConditionalOperator(
cast<Expr>(RHSStmt), Exp->getType(), VK_PRValue, OK_Ordinary);		cast<Expr>(RHSStmt), Exp->getType(), VK_PRValue, OK_Ordinary);
return CondExpr;		return CondExpr;
} else if (const ObjCIvarRefExpr *IRE = dyn_cast<ObjCIvarRefExpr>(BlockExp)) {		} else if (const ObjCIvarRefExpr *IRE = dyn_cast<ObjCIvarRefExpr>(BlockExp)) {
CPT = IRE->getType()->getAs<BlockPointerType>();		CPT = IRE->getType()->getAs<BlockPointerType>();
} else if (const PseudoObjectExpr *POE		} else if (const PseudoObjectExpr *POE
= dyn_cast<PseudoObjectExpr>(BlockExp)) {		= dyn_cast<PseudoObjectExpr>(BlockExp)) {
CPT = POE->getType()->castAs<BlockPointerType>();		CPT = POE->getType()->castAs<BlockPointerType>();
} else {		} else {
assert(false && "RewriteBlockClass: Bad type");		llvm_unreachable("RewriteBlockClass: Bad type");
}		}
assert(CPT && "RewriteBlockClass: Bad type");		assert(CPT && "RewriteBlockClass: Bad type");
const FunctionType *FT = CPT->getPointeeType()->getAs<FunctionType>();		const FunctionType *FT = CPT->getPointeeType()->getAs<FunctionType>();
assert(FT && "RewriteBlockClass: Bad type");		assert(FT && "RewriteBlockClass: Bad type");
const FunctionProtoType *FTP = dyn_cast<FunctionProtoType>(FT);		const FunctionProtoType *FTP = dyn_cast<FunctionProtoType>(FT);
// FTP will be null for closures that don't take arguments.		// FTP will be null for closures that don't take arguments.

RecordDecl RD = RecordDecl::Create(Context, TTK_Struct, TUDecl,		RecordDecl RD = RecordDecl::Create(Context, TTK_Struct, TUDecl,
▲ Show 20 Lines • Show All 2,129 Lines • Show Last 20 Lines

clang/lib/Frontend/SARIFDiagnostic.cpp

Show First 20 Lines • Show All 149 Lines • ▼ Show 20 Lines	case DiagnosticsEngine::Warning:
break;		break;
case DiagnosticsEngine::Error:		case DiagnosticsEngine::Error:
Config = Config.setLevel(SarifResultLevel::Error).setRank(50);		Config = Config.setLevel(SarifResultLevel::Error).setRank(50);
break;		break;
case DiagnosticsEngine::Fatal:		case DiagnosticsEngine::Fatal:
Config = Config.setLevel(SarifResultLevel::Error).setRank(100);		Config = Config.setLevel(SarifResultLevel::Error).setRank(100);
break;		break;
case DiagnosticsEngine::Ignored:		case DiagnosticsEngine::Ignored:
assert(false && "Invalid diagnostic type");		llvm_unreachable("Invalid diagnostic type");
}		}

return Rule.setDefaultConfiguration(Config);		return Rule.setDefaultConfiguration(Config);
}		}

llvm::StringRef SARIFDiagnostic::emitFilename(StringRef Filename,		llvm::StringRef SARIFDiagnostic::emitFilename(StringRef Filename,
const SourceManager &SM) {		const SourceManager &SM) {
if (DiagOpts->AbsolutePath) {		if (DiagOpts->AbsolutePath) {
Show All 33 Lines
///		///
/// This method handlen the emission of the diagnostic location information.		/// This method handlen the emission of the diagnostic location information.
/// This includes extracting as much location information as is present for		/// This includes extracting as much location information as is present for
/// the diagnostic and printing it, as well as any include stack or source		/// the diagnostic and printing it, as well as any include stack or source
/// ranges necessary.		/// ranges necessary.
void SARIFDiagnostic::emitDiagnosticLoc(FullSourceLoc Loc, PresumedLoc PLoc,		void SARIFDiagnostic::emitDiagnosticLoc(FullSourceLoc Loc, PresumedLoc PLoc,
DiagnosticsEngine::Level Level,		DiagnosticsEngine::Level Level,
ArrayRef<CharSourceRange> Ranges) {		ArrayRef<CharSourceRange> Ranges) {
assert(false && "Not implemented in SARIF mode");		llvm_unreachable("Not implemented in SARIF mode");
}		}

void SARIFDiagnostic::emitIncludeLocation(FullSourceLoc Loc, PresumedLoc PLoc) {		void SARIFDiagnostic::emitIncludeLocation(FullSourceLoc Loc, PresumedLoc PLoc) {
assert(false && "Not implemented in SARIF mode");		llvm_unreachable("Not implemented in SARIF mode");
}		}

void SARIFDiagnostic::emitImportLocation(FullSourceLoc Loc, PresumedLoc PLoc,		void SARIFDiagnostic::emitImportLocation(FullSourceLoc Loc, PresumedLoc PLoc,
StringRef ModuleName) {		StringRef ModuleName) {
assert(false && "Not implemented in SARIF mode");		llvm_unreachable("Not implemented in SARIF mode");
}		}

void SARIFDiagnostic::emitBuildingModuleLocation(FullSourceLoc Loc,		void SARIFDiagnostic::emitBuildingModuleLocation(FullSourceLoc Loc,
PresumedLoc PLoc,		PresumedLoc PLoc,
StringRef ModuleName) {		StringRef ModuleName) {
assert(false && "Not implemented in SARIF mode");		llvm_unreachable("Not implemented in SARIF mode");
}		}
} // namespace clang		} // namespace clang

clang/lib/Lex/PPDirectives.cpp

Show First 20 Lines • Show All 3,480 Lines • ▼ Show 20 Lines	case tok::pp_elif:
break;		break;
case tok::pp_elifdef:		case tok::pp_elifdef:
Callbacks->Elifdef(ElifToken.getLocation(), ConditionRange, CI.IfLoc);		Callbacks->Elifdef(ElifToken.getLocation(), ConditionRange, CI.IfLoc);
break;		break;
case tok::pp_elifndef:		case tok::pp_elifndef:
Callbacks->Elifndef(ElifToken.getLocation(), ConditionRange, CI.IfLoc);		Callbacks->Elifndef(ElifToken.getLocation(), ConditionRange, CI.IfLoc);
break;		break;
default:		default:
assert(false && "unexpected directive kind");		llvm_unreachable("unexpected directive kind");
break;		break;
}		}
}		}

bool RetainExcludedCB = PPOpts->RetainExcludedConditionalBlocks &&		bool RetainExcludedCB = PPOpts->RetainExcludedConditionalBlocks &&
getSourceManager().isInMainFile(ElifToken.getLocation());		getSourceManager().isInMainFile(ElifToken.getLocation());

if ((PPOpts->SingleFileParseMode && !CI.FoundNonSkip) \|\| RetainExcludedCB) {		if ((PPOpts->SingleFileParseMode && !CI.FoundNonSkip) \|\| RetainExcludedCB) {
Show All 12 Lines

clang/lib/Lex/PreprocessingRecord.cpp

	Show First 20 Lines • Show All 96 Lines • ▼ Show 20 Lines
	/// \see getPreprocessedEntitiesInRange.			/// \see getPreprocessedEntitiesInRange.
	bool PreprocessingRecord::isEntityInFileID(iterator PPEI, FileID FID) {			bool PreprocessingRecord::isEntityInFileID(iterator PPEI, FileID FID) {
	if (FID.isInvalid())			if (FID.isInvalid())
	return false;			return false;

	int Pos = std::distance(iterator(this, 0), PPEI);			int Pos = std::distance(iterator(this, 0), PPEI);
	if (Pos < 0) {			if (Pos < 0) {
	if (unsigned(-Pos-1) >= LoadedPreprocessedEntities.size()) {			if (unsigned(-Pos-1) >= LoadedPreprocessedEntities.size()) {
	assert(0 && "Out-of bounds loaded preprocessed entity");			llvm_unreachable("Out-of bounds loaded preprocessed entity");
	return false;			return false;
	}			}
	assert(ExternalSource && "No external source to load from");			assert(ExternalSource && "No external source to load from");
	unsigned LoadedIndex = LoadedPreprocessedEntities.size()+Pos;			unsigned LoadedIndex = LoadedPreprocessedEntities.size()+Pos;
	if (PreprocessedEntity *PPE = LoadedPreprocessedEntities[LoadedIndex])			if (PreprocessedEntity *PPE = LoadedPreprocessedEntities[LoadedIndex])
	return isPreprocessedEntityIfInFileID(PPE, FID, SourceMgr);			return isPreprocessedEntityIfInFileID(PPE, FID, SourceMgr);

	// See if the external source can see if the entity is in the file without			// See if the external source can see if the entity is in the file without
	// deserializing it.			// deserializing it.
	Optional<bool> IsInFile =			Optional<bool> IsInFile =
	ExternalSource->isPreprocessedEntityInFileID(LoadedIndex, FID);			ExternalSource->isPreprocessedEntityInFileID(LoadedIndex, FID);
	if (IsInFile)			if (IsInFile)
	return IsInFile.value();			return IsInFile.value();

	// The external source did not provide a definite answer, go and deserialize			// The external source did not provide a definite answer, go and deserialize
	// the entity to check it.			// the entity to check it.
	return isPreprocessedEntityIfInFileID(			return isPreprocessedEntityIfInFileID(
	getLoadedPreprocessedEntity(LoadedIndex),			getLoadedPreprocessedEntity(LoadedIndex),
	FID, SourceMgr);			FID, SourceMgr);
	}			}

	if (unsigned(Pos) >= PreprocessedEntities.size()) {			if (unsigned(Pos) >= PreprocessedEntities.size()) {
	assert(0 && "Out-of bounds local preprocessed entity");			llvm_unreachable("Out-of bounds local preprocessed entity");
	return false;			return false;
	}			}
	return isPreprocessedEntityIfInFileID(PreprocessedEntities[Pos],			return isPreprocessedEntityIfInFileID(PreprocessedEntities[Pos],
	FID, SourceMgr);			FID, SourceMgr);
	}			}

	/// Returns a pair of [Begin, End) iterators of preprocessed entities			/// Returns a pair of [Begin, End) iterators of preprocessed entities
	/// that source range \arg R encompasses.			/// that source range \arg R encompasses.
	▲ Show 20 Lines • Show All 398 Lines • Show Last 20 Lines

clang/lib/Parse/ParseDecl.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,552 Lines • ▼ Show 20 Lines	ParseIdentifier: {
// attribute declaration and continue.		// attribute declaration and continue.
if (NextToken().is(tok::l_paren)) {		if (NextToken().is(tok::l_paren)) {
// Consume the __declspec identifier.		// Consume the __declspec identifier.
ConsumeToken();		ConsumeToken();

// Eat the parens and everything between them.		// Eat the parens and everything between them.
BalancedDelimiterTracker T(*this, tok::l_paren);		BalancedDelimiterTracker T(*this, tok::l_paren);
if (T.consumeOpen()) {		if (T.consumeOpen()) {
assert(false && "Not a left paren?");		llvm_unreachable("Not a left paren?");
return;		return;
}		}
T.skipToEnd();		T.skipToEnd();
continue;		continue;
}		}
}		}

// In C++, check to see if this is a scope specifier like foo::bar::, if		// In C++, check to see if this is a scope specifier like foo::bar::, if
▲ Show 20 Lines • Show All 4,208 Lines • Show Last 20 Lines

clang/lib/Sema/SemaChecking.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,104 Lines • ▼ Show 20 Lines	if (BuiltinID == AArch64::BI__builtin_arm_subp) {
if (isNull(ArgB))		if (isNull(ArgB))
ArgExprB = ImpCastExprToType(ArgExprB.get(), ArgTypeA, CK_NullToPointer);		ArgExprB = ImpCastExprToType(ArgExprB.get(), ArgTypeA, CK_NullToPointer);

TheCall->setArg(0, ArgExprA.get());		TheCall->setArg(0, ArgExprA.get());
TheCall->setArg(1, ArgExprB.get());		TheCall->setArg(1, ArgExprB.get());
TheCall->setType(Context.LongLongTy);		TheCall->setType(Context.LongLongTy);
return false;		return false;
}		}
assert(false && "Unhandled ARM MTE intrinsic");		llvm_unreachable("Unhandled ARM MTE intrinsic");
return true;		return true;
}		}

/// SemaBuiltinARMSpecialReg - Handle a check if argument ArgNum of CallExpr		/// SemaBuiltinARMSpecialReg - Handle a check if argument ArgNum of CallExpr
/// TheCall is an ARM/AArch64 special register string literal.		/// TheCall is an ARM/AArch64 special register string literal.
bool Sema::SemaBuiltinARMSpecialReg(unsigned BuiltinID, CallExpr *TheCall,		bool Sema::SemaBuiltinARMSpecialReg(unsigned BuiltinID, CallExpr *TheCall,
int ArgNum, unsigned ExpectedFieldNum,		int ArgNum, unsigned ExpectedFieldNum,
bool AllowName) {		bool AllowName) {
▲ Show 20 Lines • Show All 9,762 Lines • Show Last 20 Lines

clang/lib/Sema/SemaCodeComplete.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 542 Lines • ▼ Show 20 Lines	case tok::plusplus:
if (ContextType.isNull())		if (ContextType.isNull())
return S.getASTContext().IntTy;		return S.getASTContext().IntTy;
// leave as is, these operators typically return the same type.		// leave as is, these operators typically return the same type.
return ContextType;		return ContextType;
case tok::kw___real:		case tok::kw___real:
case tok::kw___imag:		case tok::kw___imag:
return QualType();		return QualType();
default:		default:
assert(false && "unhandled unary op");		llvm_unreachable("unhandled unary op");
return QualType();		return QualType();
}		}
}		}

void PreferredTypeBuilder::enterBinary(Sema &S, SourceLocation Tok, Expr *LHS,		void PreferredTypeBuilder::enterBinary(Sema &S, SourceLocation Tok, Expr *LHS,
tok::TokenKind Op) {		tok::TokenKind Op) {
if (!Enabled)		if (!Enabled)
return;		return;
▲ Show 20 Lines • Show All 9,524 Lines • Show Last 20 Lines

clang/lib/Serialization/ASTReader.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,458 Lines • ▼ Show 20 Lines	if (D) {
Merged.push_back(ID);		Merged.push_back(ID);
}		}
return D;		return D;
}		}

unsigned Index = ID - NUM_PREDEF_DECL_IDS;		unsigned Index = ID - NUM_PREDEF_DECL_IDS;

if (Index >= DeclsLoaded.size()) {		if (Index >= DeclsLoaded.size()) {
assert(0 && "declaration ID out-of-range for AST file");		llvm_unreachable("declaration ID out-of-range for AST file");
Error("declaration ID out-of-range for AST file");		Error("declaration ID out-of-range for AST file");
return nullptr;		return nullptr;
}		}

return DeclsLoaded[Index];		return DeclsLoaded[Index];
}		}

Decl *ASTReader::GetDecl(DeclID ID) {		Decl *ASTReader::GetDecl(DeclID ID) {
if (ID < NUM_PREDEF_DECL_IDS)		if (ID < NUM_PREDEF_DECL_IDS)
return GetExistingDecl(ID);		return GetExistingDecl(ID);

unsigned Index = ID - NUM_PREDEF_DECL_IDS;		unsigned Index = ID - NUM_PREDEF_DECL_IDS;

if (Index >= DeclsLoaded.size()) {		if (Index >= DeclsLoaded.size()) {
assert(0 && "declaration ID out-of-range for AST file");		llvm_unreachable("declaration ID out-of-range for AST file");
Error("declaration ID out-of-range for AST file");		Error("declaration ID out-of-range for AST file");
return nullptr;		return nullptr;
}		}

if (!DeclsLoaded[Index]) {		if (!DeclsLoaded[Index]) {
ReadDeclRecord(ID);		ReadDeclRecord(ID);
if (DeserializationListener)		if (DeserializationListener)
DeserializationListener->DeclRead(ID, DeclsLoaded[Index]);		DeserializationListener->DeclRead(ID, DeclsLoaded[Index]);
▲ Show 20 Lines • Show All 3,688 Lines • Show Last 20 Lines

clang/lib/Serialization/ASTWriter.cpp

Show First 20 Lines • Show All 2,432 Lines • ▼ Show 20 Lines	void ASTWriter::WritePreprocessor(const Preprocessor &PP, bool IsModule) {
std::vector<uint32_t> MacroOffsets;		std::vector<uint32_t> MacroOffsets;

for (unsigned I = 0, N = MacroInfosToEmit.size(); I != N; ++I) {		for (unsigned I = 0, N = MacroInfosToEmit.size(); I != N; ++I) {
const IdentifierInfo *Name = MacroInfosToEmit[I].Name;		const IdentifierInfo *Name = MacroInfosToEmit[I].Name;
MacroInfo *MI = MacroInfosToEmit[I].MI;		MacroInfo *MI = MacroInfosToEmit[I].MI;
MacroID ID = MacroInfosToEmit[I].ID;		MacroID ID = MacroInfosToEmit[I].ID;

if (ID < FirstMacroID) {		if (ID < FirstMacroID) {
assert(0 && "Loaded MacroInfo entered MacroInfosToEmit ?");		llvm_unreachable("Loaded MacroInfo entered MacroInfosToEmit ?");
continue;		continue;
}		}

// Record the local offset of this macro.		// Record the local offset of this macro.
unsigned Index = ID - FirstMacroID;		unsigned Index = ID - FirstMacroID;
if (Index >= MacroOffsets.size())		if (Index >= MacroOffsets.size())
MacroOffsets.resize(Index + 1);		MacroOffsets.resize(Index + 1);

▲ Show 20 Lines • Show All 2,926 Lines • ▼ Show 20 Lines	TypeID ASTWriter::GetOrCreateTypeID(QualType T) {
return MakeTypeID(*Context, T, [&](QualType T) -> TypeIdx {		return MakeTypeID(*Context, T, [&](QualType T) -> TypeIdx {
if (T.isNull())		if (T.isNull())
return TypeIdx();		return TypeIdx();
assert(!T.getLocalFastQualifiers());		assert(!T.getLocalFastQualifiers());

TypeIdx &Idx = TypeIdxs[T];		TypeIdx &Idx = TypeIdxs[T];
if (Idx.getIndex() == 0) {		if (Idx.getIndex() == 0) {
if (DoneWritingDeclsAndTypes) {		if (DoneWritingDeclsAndTypes) {
assert(0 && "New type seen after serializing all the types to emit!");		llvm_unreachable(
		"New type seen after serializing all the types to emit!");
return TypeIdx();		return TypeIdx();
}		}

// We haven't seen this type before. Assign it a new ID and put it		// We haven't seen this type before. Assign it a new ID and put it
// into the queue of types to emit.		// into the queue of types to emit.
Idx = TypeIdx(NextTypeID++);		Idx = TypeIdx(NextTypeID++);
DeclTypesToEmit.push(T);		DeclTypesToEmit.push(T);
}		}
Show All 29 Lines	DeclID ASTWriter::GetDeclRef(const Decl *D) {
// fixed.		// fixed.
if (D->isFromASTFile())		if (D->isFromASTFile())
return D->getGlobalID();		return D->getGlobalID();

assert(!(reinterpret_cast<uintptr_t>(D) & 0x01) && "Invalid decl pointer");		assert(!(reinterpret_cast<uintptr_t>(D) & 0x01) && "Invalid decl pointer");
DeclID &ID = DeclIDs[D];		DeclID &ID = DeclIDs[D];
if (ID == 0) {		if (ID == 0) {
if (DoneWritingDeclsAndTypes) {		if (DoneWritingDeclsAndTypes) {
assert(0 && "New decl seen after serializing all the decls to emit!");		llvm_unreachable(
		"New decl seen after serializing all the decls to emit!");
return 0;		return 0;
}		}

// We haven't seen this declaration before. Give it a new ID and		// We haven't seen this declaration before. Give it a new ID and
// enqueue it in the list of declarations to emit.		// enqueue it in the list of declarations to emit.
ID = NextDeclID++;		ID = NextDeclID++;
DeclTypesToEmit.push(const_cast<Decl *>(D));		DeclTypesToEmit.push(const_cast<Decl *>(D));
}		}
▲ Show 20 Lines • Show All 1,512 Lines • Show Last 20 Lines

clang/lib/StaticAnalyzer/Core/CoreEngine.cpp

Show First 20 Lines • Show All 186 Lines • ▼ Show 20 Lines	case ProgramPoint::BlockEdgeKind:
HandleBlockEdge(Loc.castAs<BlockEdge>(), Pred);		HandleBlockEdge(Loc.castAs<BlockEdge>(), Pred);
break;		break;

case ProgramPoint::BlockEntranceKind:		case ProgramPoint::BlockEntranceKind:
HandleBlockEntrance(Loc.castAs<BlockEntrance>(), Pred);		HandleBlockEntrance(Loc.castAs<BlockEntrance>(), Pred);
break;		break;

case ProgramPoint::BlockExitKind:		case ProgramPoint::BlockExitKind:
assert(false && "BlockExit location never occur in forward analysis.");		llvm_unreachable("BlockExit location never occur in forward analysis.");
break;		break;

case ProgramPoint::CallEnterKind:		case ProgramPoint::CallEnterKind:
HandleCallEnter(Loc.castAs<CallEnter>(), Pred);		HandleCallEnter(Loc.castAs<CallEnter>(), Pred);
break;		break;

case ProgramPoint::CallExitBeginKind:		case ProgramPoint::CallExitBeginKind:
ExprEng.processCallExit(Pred);		ExprEng.processCallExit(Pred);
▲ Show 20 Lines • Show All 533 Lines • Show Last 20 Lines

clang/lib/StaticAnalyzer/Core/SVals.cpp

Show First 20 Lines • Show All 349 Lines • ▼ Show 20 Lines	case nonloc::PointerToMemberKind: {

os << I->getType();		os << I->getType();
}		}

os << '}';		os << '}';
break;		break;
}		}
default:		default:
assert(false && "Pretty-printed not implemented for this NonLoc.");		llvm_unreachable("Pretty-printed not implemented for this NonLoc.");
break;		break;
}		}
}		}

void Loc::dumpToStream(raw_ostream &os) const {		void Loc::dumpToStream(raw_ostream &os) const {
switch (getSubKind()) {		switch (getSubKind()) {
case loc::ConcreteIntKind:		case loc::ConcreteIntKind:
os << castAs<loc::ConcreteInt>().getValue().getZExtValue() << " (Loc)";		os << castAs<loc::ConcreteInt>().getValue().getZExtValue() << " (Loc)";
Show All 11 Lines

clang/tools/clang-refactor/TestSupport.cpp

Show First 20 Lines • Show All 342 Lines • ▼ Show 20 Lines	if (!RangeRegex.match(Comment, &Matches) \|\| Comment.contains("CHECK")) {
return None;		return None;
continue;		continue;
}		}
unsigned Offset = Tok.getEndLoc().getRawEncoding();		unsigned Offset = Tok.getEndLoc().getRawEncoding();
unsigned ColumnOffset = 0;		unsigned ColumnOffset = 0;
if (!Matches[2].empty()) {		if (!Matches[2].empty()) {
// Don't forget to drop the '+'!		// Don't forget to drop the '+'!
if (Matches[2].drop_front().getAsInteger(10, ColumnOffset))		if (Matches[2].drop_front().getAsInteger(10, ColumnOffset))
assert(false && "regex should have produced a number");		llvm_unreachable("regex should have produced a number");
}		}
Offset = addColumnOffset(Source, Offset, ColumnOffset);		Offset = addColumnOffset(Source, Offset, ColumnOffset);
unsigned EndOffset;		unsigned EndOffset;

if (!Matches[3].empty()) {		if (!Matches[3].empty()) {
static const Regex EndLocRegex(		static const Regex EndLocRegex(
"->[[:blank:]]*(\\+[[:digit:]]+):([[:digit:]]+)");		"->[[:blank:]]*(\\+[[:digit:]]+):([[:digit:]]+)");
SmallVector<StringRef, 4> EndLocMatches;		SmallVector<StringRef, 4> EndLocMatches;
if (!EndLocRegex.match(Matches[3], &EndLocMatches)) {		if (!EndLocRegex.match(Matches[3], &EndLocMatches)) {
if (DetectMistypedCommand())		if (DetectMistypedCommand())
return None;		return None;
continue;		continue;
}		}
unsigned EndLineOffset = 0, EndColumn = 0;		unsigned EndLineOffset = 0, EndColumn = 0;
if (EndLocMatches[1].drop_front().getAsInteger(10, EndLineOffset) \|\|		if (EndLocMatches[1].drop_front().getAsInteger(10, EndLineOffset) \|\|
EndLocMatches[2].getAsInteger(10, EndColumn))		EndLocMatches[2].getAsInteger(10, EndColumn))
assert(false && "regex should have produced a number");		llvm_unreachable("regex should have produced a number");
EndOffset = addEndLineOffsetAndEndColumn(Source, Offset, EndLineOffset,		EndOffset = addEndLineOffsetAndEndColumn(Source, Offset, EndLineOffset,
EndColumn);		EndColumn);
} else {		} else {
EndOffset = Offset;		EndOffset = Offset;
}		}
TestSelectionRange Range = {Offset, EndOffset};		TestSelectionRange Range = {Offset, EndOffset};
auto It = GroupedRanges.insert(std::make_pair(		auto It = GroupedRanges.insert(std::make_pair(
Matches[1].str(), SmallVector<TestSelectionRange, 8>{Range}));		Matches[1].str(), SmallVector<TestSelectionRange, 8>{Range}));
Show All 17 Lines

clang/tools/libclang/CIndex.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 193 Lines • ▼ Show 20 Lines
/// should continue.		/// should continue.
bool CursorVisitor::Visit(CXCursor Cursor, bool CheckedRegionOfInterest) {		bool CursorVisitor::Visit(CXCursor Cursor, bool CheckedRegionOfInterest) {
if (clang_isInvalid(Cursor.kind))		if (clang_isInvalid(Cursor.kind))
return false;		return false;

if (clang_isDeclaration(Cursor.kind)) {		if (clang_isDeclaration(Cursor.kind)) {
const Decl *D = getCursorDecl(Cursor);		const Decl *D = getCursorDecl(Cursor);
if (!D) {		if (!D) {
assert(0 && "Invalid declaration cursor");		llvm_unreachable("Invalid declaration cursor");
return true; // abort.		return true; // abort.
}		}

// Ignore implicit declarations, unless it's an objc method because		// Ignore implicit declarations, unless it's an objc method because
// currently we should report implicit methods for properties when indexing.		// currently we should report implicit methods for properties when indexing.
if (D->isImplicit() && !isa<ObjCMethodDecl>(D))		if (D->isImplicit() && !isa<ObjCMethodDecl>(D))
return false;		return false;
}		}
▲ Show 20 Lines • Show All 4,972 Lines • ▼ Show 20 Lines	clang_PrintingPolicy_getProperty(CXPrintingPolicy Policy,
case CXPrintingPolicy_ConstantsAsWritten:		case CXPrintingPolicy_ConstantsAsWritten:
return P->ConstantsAsWritten;		return P->ConstantsAsWritten;
case CXPrintingPolicy_SuppressImplicitBase:		case CXPrintingPolicy_SuppressImplicitBase:
return P->SuppressImplicitBase;		return P->SuppressImplicitBase;
case CXPrintingPolicy_FullyQualifiedName:		case CXPrintingPolicy_FullyQualifiedName:
return P->FullyQualifiedName;		return P->FullyQualifiedName;
}		}

assert(false && "Invalid CXPrintingPolicyProperty");		llvm_unreachable("Invalid CXPrintingPolicyProperty");
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions This one is a bit questionable -- this is part of the C interface we expose, which is ABI stable, so the assert was alerting users to potential mismatches between versions of the library. aaron.ballman: This one is a bit questionable -- this is part of the C interface we expose, which is ABI…
return 0;		return 0;
}		}

void clang_PrintingPolicy_setProperty(CXPrintingPolicy Policy,		void clang_PrintingPolicy_setProperty(CXPrintingPolicy Policy,
enum CXPrintingPolicyProperty Property,		enum CXPrintingPolicyProperty Property,
unsigned Value) {		unsigned Value) {
if (!Policy)		if (!Policy)
return;		return;
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	void clang_PrintingPolicy_setProperty(CXPrintingPolicy Policy,
case CXPrintingPolicy_SuppressImplicitBase:		case CXPrintingPolicy_SuppressImplicitBase:
P->SuppressImplicitBase = Value;		P->SuppressImplicitBase = Value;
return;		return;
case CXPrintingPolicy_FullyQualifiedName:		case CXPrintingPolicy_FullyQualifiedName:
P->FullyQualifiedName = Value;		P->FullyQualifiedName = Value;
return;		return;
}		}

assert(false && "Invalid CXPrintingPolicyProperty");		llvm_unreachable("Invalid CXPrintingPolicyProperty");
}		}

CXString clang_getCursorPrettyPrinted(CXCursor C, CXPrintingPolicy cxPolicy) {		CXString clang_getCursorPrettyPrinted(CXCursor C, CXPrintingPolicy cxPolicy) {
if (clang_Cursor_isNull(C))		if (clang_Cursor_isNull(C))
return cxstring::createEmpty();		return cxstring::createEmpty();

if (clang_isDeclaration(C.kind)) {		if (clang_isDeclaration(C.kind)) {
const Decl *D = getCursorDecl(C);		const Decl *D = getCursorDecl(C);
▲ Show 20 Lines • Show All 4,177 Lines • Show Last 20 Lines

clang/tools/libclang/CXCursor.cpp

Show First 20 Lines • Show All 1,481 Lines • ▼ Show 20 Lines	CXType clang_Cursor_getTemplateArgumentType(CXCursor C, unsigned I) {

return cxtype::MakeCXType(TA.getAsType(), getCursorTU(C));		return cxtype::MakeCXType(TA.getAsType(), getCursorTU(C));
}		}

long long clang_Cursor_getTemplateArgumentValue(CXCursor C, unsigned I) {		long long clang_Cursor_getTemplateArgumentValue(CXCursor C, unsigned I) {
TemplateArgument TA;		TemplateArgument TA;
if (clang_Cursor_getTemplateArgument(C, I, &TA) !=		if (clang_Cursor_getTemplateArgument(C, I, &TA) !=
CXGetTemplateArgumentStatus_Success) {		CXGetTemplateArgumentStatus_Success) {
assert(0 && "Unable to retrieve TemplateArgument");		llvm_unreachable("Unable to retrieve TemplateArgument");
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Each of these is actually reachable -- the asserts exist specifically to tell users of the C interface about problems with their assumptions. In each of these cases, the assert is avoiding the need for a local variable to assert on. aaron.ballman: Each of these is actually reachable -- the asserts exist specifically to tell users of the C…
return 0;		return 0;
}		}

if (TA.getKind() != TemplateArgument::Integral) {		if (TA.getKind() != TemplateArgument::Integral) {
assert(0 && "Passed template argument is not Integral");		llvm_unreachable("Passed template argument is not Integral");
return 0;		return 0;
}		}

return TA.getAsIntegral().getSExtValue();		return TA.getAsIntegral().getSExtValue();
}		}

unsigned long long clang_Cursor_getTemplateArgumentUnsignedValue(CXCursor C,		unsigned long long clang_Cursor_getTemplateArgumentUnsignedValue(CXCursor C,
unsigned I) {		unsigned I) {
TemplateArgument TA;		TemplateArgument TA;
if (clang_Cursor_getTemplateArgument(C, I, &TA) !=		if (clang_Cursor_getTemplateArgument(C, I, &TA) !=
CXGetTemplateArgumentStatus_Success) {		CXGetTemplateArgumentStatus_Success) {
assert(0 && "Unable to retrieve TemplateArgument");		llvm_unreachable("Unable to retrieve TemplateArgument");
return 0;		return 0;
}		}

if (TA.getKind() != TemplateArgument::Integral) {		if (TA.getKind() != TemplateArgument::Integral) {
assert(0 && "Passed template argument is not Integral");		llvm_unreachable("Passed template argument is not Integral");
return 0;		return 0;
}		}

return TA.getAsIntegral().getZExtValue();		return TA.getAsIntegral().getZExtValue();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// CXCursorSet.		// CXCursorSet.
▲ Show 20 Lines • Show All 257 Lines • Show Last 20 Lines

clang/utils/TableGen/ClangSyntaxEmitter.cpp

Show First 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	struct SyntaxConstraint {
SyntaxConstraint(const llvm::Record &R) {		SyntaxConstraint(const llvm::Record &R) {
if (R.isSubClassOf("Optional")) {		if (R.isSubClassOf("Optional")) {
this = SyntaxConstraint(R.getValueAsDef("inner"));		this = SyntaxConstraint(R.getValueAsDef("inner"));
} else if (R.isSubClassOf("AnyToken")) {		} else if (R.isSubClassOf("AnyToken")) {
NodeType = "Leaf";		NodeType = "Leaf";
} else if (R.isSubClassOf("NodeType")) {		} else if (R.isSubClassOf("NodeType")) {
NodeType = R.getName().str();		NodeType = R.getName().str();
} else {		} else {
assert(false && "Unhandled Syntax kind");		llvm_unreachable("Unhandled Syntax kind");
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions This should not be using unreachable -- the code is very much reachable. This should have changed from `assert` to `PrintFatalError`. aaron.ballman: This should not be using unreachable -- the code is very much reachable. This should have…
}		}
}		}

std::string NodeType;		std::string NodeType;
// optional and leaf types also go here, once we want to use them.		// optional and leaf types also go here, once we want to use them.
};		};

} // namespace		} // namespace
▲ Show 20 Lines • Show All 108 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[clang] replace `assert(0)` with `llvm_unreachable` NFCNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 466471

clang/include/clang/AST/Redeclarable.h

clang/include/clang/StaticAnalyzer/Core/PathSensitive/MemRegion.h

clang/lib/AST/ASTImporter.cpp

clang/lib/AST/ExprConstant.cpp

clang/lib/AST/Interp/ByteCodeExprGen.cpp

clang/lib/Analysis/CFG.cpp

clang/lib/Basic/SourceManager.cpp

clang/lib/Basic/Targets/NVPTX.cpp

clang/lib/CodeGen/CGHLSLRuntime.cpp

clang/lib/CodeGen/CodeGenModule.cpp

clang/lib/Driver/Multilib.cpp

clang/lib/Driver/ToolChains/Flang.cpp

clang/lib/Format/Format.cpp

clang/lib/Frontend/Rewrite/RewriteModernObjC.cpp

clang/lib/Frontend/Rewrite/RewriteObjC.cpp

clang/lib/Frontend/SARIFDiagnostic.cpp

clang/lib/Lex/PPDirectives.cpp

clang/lib/Lex/PreprocessingRecord.cpp

clang/lib/Parse/ParseDecl.cpp

clang/lib/Sema/SemaChecking.cpp

clang/lib/Sema/SemaCodeComplete.cpp

clang/lib/Serialization/ASTReader.cpp

clang/lib/Serialization/ASTWriter.cpp

clang/lib/StaticAnalyzer/Core/CoreEngine.cpp

clang/lib/StaticAnalyzer/Core/SVals.cpp

clang/tools/clang-refactor/TestSupport.cpp

clang/tools/libclang/CIndex.cpp

clang/tools/libclang/CXCursor.cpp

clang/utils/TableGen/ClangSyntaxEmitter.cpp

[clang] replace `assert(0)` with `llvm_unreachable` NFC
Needs ReviewPublic