Instead of hardcoding individual GPU mappings in multiple functions, keep them
all in one table and use it to look up the mappings.
We also don't care about 'virtual' architecture much, so the API is trimmed down
down to a simpler GPU->Virtual arch name lookup.
Thanks for the efforts. Really appreciate this.
Can we separate this part to a def file and include it here and also in enum class CudaArch? Then we only need to maintain the def file.
Thanks.