Just running -fsyntax-only over arm_neon.h doesn't cover some intrinsics which are defined using macros. Add more test coverage for that.
arm-neon-header.c wasn't checking the full set of available NEON target features; change the target architecture to account for that.
We were not generating necessary casts in some cases; add test coverage for that.
Fix VFMLAL_LOW etc. to express their signatures differently, so the builtins have the expected type. Maybe the TableGen backend should detect this and produce an error. The rules here are sort of strange.