HomePhabricator

[CUDA] Enable fusing FP ops (-ffp-contract=fast) for CUDA by default.

Description

[CUDA] Enable fusing FP ops (-ffp-contract=fast) for CUDA by default.

This matches default nvcc behavior and gives substantial
performance boost on GPU where fmad is much cheaper compared to add+mul.

Differential Revision: http://reviews.llvm.org/D20341

Details

Committed
traMay 19 2016, 11:44 AM
Differential Revision
D20341: [CUDA] Enable fusing FP ops for CUDA by default.
Branches
Unknown
Tags
Unknown