This optimization merges adjacent zero stores into a wider store.
e.g.,
strh wzr, [x0] strh wzr, [x0, #2] ; becomes str wzr, [x0]
e.g.,
str wzr, [x0] str wzr, [x0, #4] ; becomes str xzr, [x0]
Previously, this was only enabled for Kryo and Cortex-A57. I'd like to enable it for all subtargets.
Chad
I don't see the benefits of testing this for every CPU in the AArch64 target. Maybe 1 single RUN: is enough here?