AMDGPU code with enabled address sanitizer generates tons of stack objects (> 200000) and
takes forever to compile due to the time spent on slot sharing.
I made up the threshold value, not sure what it should be (apparently more than 42) but
on the other hand - the more stack objects you have the more useful is sharing.