Call slot optimization needs to know that writing to the destination of the
memcpy won't trap, so it has to know the number of dereferenceable bytes of the
destination. Currently it can only handle this for allocas and function
arguments that are directly written to. We can improve upon this by handling
simple GEPs with constant offsets.
This allows call slot optimizations to happen in code like this:
struct Foo {
int o[200];
};
struct Bar {
char p; Foo f;
};
__attribute__((noinline))
Foo moo()
{
return {0};
}
Bar goo()
{
Bar f; f.f = moo(); return f;
}
Which is a slightly modified version of the code in PR35134 that works around
the missing alias information in the original C++ code.
It also makes the optimization happen in the Rust code that originally lead to
the above PR.
arraySize -> ArraySize