At the moment we codegen or interpret the IR in exactly the way as it is generated by Clang.
This patch runs the default O2 optimizations over the generated LLVM module before we
interpret or JIT it.
The motivation for this patch is twofold. First, LLDB contains an LLVM IR interpreter that
supports a very limited subset of LLVM instructions for the sake of maintainability. The
IR interpreter is intended as a fast first path to run the code provided by the user (fast in
comparison to JITing and injecting the code). It also is the only way LLDB can evaluate
expressions for targets where we cant JIT&inject code. Because the LLVM IR interpreter
only supports a limited set of instructions, most non-trivial Clang expression can't be
interpreted which is really limiting the expression evaluator functionality when we can't
JIT code. By running running the default -O2 passes over the IR before interpreting it,
LLVM will give us usually a much more concise IR that the IR interpreter can more often
than not handle.
Second, sometimes users want to run expressions that are simple but might process quite
a bit of data. An example for that would be any algorithm that users can now invoke via
the std C++ module. Calling std::count_if with a relatively simple predicate on a large
vector currently takes far too long to be considered usable.
To give some stats on the first claim:
- Currently we run about 7700 expressions in a test suit run (the actual number of expressions is fluctuating a bit....)
- Around 5800 of these expressions are simple enough that the IR interpreter can interpret them right now (most of them just print simple variables)
- This leaves us with around 1900 expressions that we currently can't interpret in the test suite.
- After applying this patch the number of expressions we can't interpret drops to around 1300 (-33%).
- The average expression evaluation time when running on my local machine doesn't change with this patch (around 50ms before and after).
The patch consists of two parts:
- The generic (new) pass manager setup and run code.
- Some new code that injects a function call at the end of our expressions.
The injected function call just exists to prevent the optimizer from removing the store to our
result variable. Right now we capture l-values by having a pointer that points to alloca'd memory.
The optimizer thinks this is a dead store (which it actually is but we read the memory after the
call has finished) and removes it. By injecting a call to an undefined function that just takes the
address of the result we can prevent any of these optimizations. The call is alter on removed
after the pass manager is done so that we can actually interpret/JIT the module again.
Why is it safe for lldb to take the address of "c"? After evaluation completes, is the state of the stack preserved?