[D56411] Temp solution fixing CUDA template issue
- template with overloadable kernel function as the template function need revising CheckCUDACall checking.
[SelectionDAG] Harden the checking of RegClass when adding operand
- If the operand index is out-of-range, expect nullptr is returned.
[AMDGPU] Allow using integral non-type template parameters
- Allow using integral non-type template parameters in the following attributes
attribute((amdgpu_flat_work_group_size(<min>, <max>))) attribute((amdgpu_waves_per_eu(<min>[, <max>])))