These do not exist in ptxas, it refuses to compile them.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
- Build Status
Buildable 21728 Build 21728: arc lint + arc unit
Event Timeline
Comment Actions
This is a surprise. PTX ISA does not mention that .ftz is not applicable to cvt.*.f16.* instructions.
Is it only cvt that does not support .ftz or does it impact other instructions? PTX spec has add/sub/mul/fma/set/setp instructions that support f16 and have .ftz variant.
Comment Actions
It's only cvt with an explicit rounding mode. I actually ran the output of f16-instructions.ll with FTZ through ptxas and removed instructions until it compiled it. This might even be a bug in ptxas.
Comment Actions
It may be worth filing a bug with NVIDIA to either fix the problem or clarify the docs.