This fixes a 5% performance regression in CPython for some inputs by fixing codegen for any_find_slice in unicodeobject.c
https://github.com/python/cpython/blob/main/Objects/unicodeobject.c#L8748
When enabling -aarch64-enable-gep-opt the transform splits GEP instruction generating unecessary instructions.
Should this be loading from nullptr?