This returns the number of physical processors that can run a team on the specified device. For AMD, this is the number of CUs. for Nvidia, this is number of SMs. For CPUs, this COULD be number of sockets if multiple teams are supported. This API is needed for optimizing cross team reductions where we want to minimize the number of intermediate per-team reduction values.
If the device id is the initial device, 1 is returned.
It's unclear why we need to store this in two places, the plugins and here. Other device data only lives in the plugins, this should too.