Add immutable WASM global __tls_align which stores the alignment
requirements of the TLS segment.
Add __builtin_wasm_tls_align() intrinsic to get this alignment in Clang.
The expected usage has now changed to:
__wasm_init_tls(memalign(__builtin_wasm_tls_align(), __builtin_wasm_tls_size()));
__tls_align