Previously if you did e.g.
$ clang -march=haswell -x cuda foo.cu
we would pass "-march=haswell -march=sm_20" down to the ptxas tool.
This causes it to assert, and rightly so!
Differential D21419
[CUDA] Don't pass top-level -march down to device cc1 or ptxas. jlebar on Jun 15 2016, 4:23 PM. Authored by
Details Previously if you did e.g. $ clang -march=haswell -x cuda foo.cu we would pass "-march=haswell -march=sm_20" down to the ptxas tool.
Diff Detail Event Timeline
|
These look redundant -- we only care whether we eliminate -march on device side. It does not depend on the value of -march or on particular GPU arch. What do I miss?