This patch adds support for the
__kmpc_get_hardware_num_threads_in_block function that returns the
number of threads. This was missing in the new runtime and was used by
the AMDGPU plugin which prevented it from using the new runtime. This
patchs also unified the interface for getting the thread numbers in the
Originally authored by jdoerfert.