Currently, the default alignment is much larger than the actual size of the vector in memory. Fix this to use a sane default.
For SVE, temporarily remove lowering of load/store operations for predicates with less than 16 elements. The layout the backend was assuming for SVE predicates with less than 16 elements doesn't agree with the datalayout. More work probably needs to be done here.
This change is, strictly speaking, not backwards-compatible at the bitcode level. But probably nobody is actually depending on that; i1 vectors in memory are rare, and the code that does use them probably ends up forcing the alignment to something sane anyway. If we think this is a concern, I can restrict this to scalable vectors for now (where it's actually causing issues for me at the moment).
I did my best to update the regression tests, but I'm not completely sure I did it correctly for amdgpu and nvptx.
Is there any way to set the alignment of fixed vector type to 1 byte rather than a power of 2 by default?