This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Target/NVPTX/
-
lib/
-
Target/
-
NVPTX/
3/6
NVPTXTargetTransformInfo.cpp

Differential D139481

NVPTX: Cleanup check for denormal mode
ClosedPublic

Authored by arsenm on Dec 6 2022, 3:56 PM.

Download Raw Diff

Details

Reviewers

tra
bkramer

Summary

Go through the common query and be explicit about the supported flush
type.

Diff Detail

Event Timeline

arsenm created this revision.Dec 6 2022, 3:56 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 6 2022, 3:56 PM

Herald added subscribers: mattd, gchakrabarti, asavonic, hiraditya. · View Herald Transcript

arsenm requested review of this revision.Dec 6 2022, 3:56 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 6 2022, 3:56 PM

Herald added subscribers: wdng, jholewinski. · View Herald Transcript

arsenm added inline comments.Dec 6 2022, 3:57 PM

llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.cpp
374	Looks broken for double to me but I don't know what the options are

tra accepted this revision.Dec 6 2022, 3:58 PM

This revision is now accepted and ready to land.Dec 6 2022, 3:58 PM

tra added inline comments.Dec 6 2022, 4:16 PM

llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.cpp
149–151	Huh. I wonder why are we using isHalfTy for figuring out denormal mode. Considering that it seems to be FP32 that is the special case with an attribute of its own, it should've been `IsSingleTy`. Knowing whether it's fp16 or not only allows us to get correct mode for fp16, but not for the other types.
374	Indeed. I do not understand why getDenormalMode appears to single out fp32 as a special case.

arsenm added inline comments.Dec 6 2022, 4:17 PM

llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.cpp
374	Because turn on f32 flush is the option the languages actually expose. denormal-fp-math-f32 acts as an override to denormal-fp-math

tra added inline comments.Dec 6 2022, 4:54 PM

llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.cpp
374	OK, then the code is (and has been) broken for 'double'. I do not think we want to override denormals mode for doubles using `denormal-fp-math-f32`. On the other hand, it's no more broken than it was before, so we can probably live with that until we figure out how to fix this. https://llvm.org/docs/LangRef.html says: " denormal-fp-math-f32" [...] Not all targets support separately setting the denormal mode per type, and no attempt is made to diagnose unsupported uses. Currently this attribute is respected by the AMDGPU and NVPTX backends. On a side note, I wonder if the denormals handling mode may need to grow a similar override for fp16, which also has .ftz instruction variants.

arsenm added inline comments.Dec 6 2022, 5:05 PM

llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.cpp
374	But do those imply a change in the default FP mode, or are they just operations that have flushing semantics in the default mode

Harbormaster completed remote builds in B201524: Diff 480664.Dec 7 2022, 12:33 AM

90f60a6a737b397c49c56371f628e4b6440c00fd

Revision Contents

Path

Size

llvm/

lib/

Target/

NVPTX/

NVPTXTargetTransformInfo.cpp

9 lines

Diff 480664

llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.cpp

Show First 20 Lines • Show All 140 Lines • ▼ Show 20 Lines	static Instruction simplifyNvvmIntrinsic(IntrinsicInst II, InstCombiner &IC) {
struct SimplifyAction {		struct SimplifyAction {
// Invariant: At most one of these Optionals has a value.		// Invariant: At most one of these Optionals has a value.
std::optional<Intrinsic::ID> IID;		std::optional<Intrinsic::ID> IID;
std::optional<Instruction::CastOps> CastOp;		std::optional<Instruction::CastOps> CastOp;
std::optional<Instruction::BinaryOps> BinaryOp;		std::optional<Instruction::BinaryOps> BinaryOp;
std::optional<SpecialCase> Special;		std::optional<SpecialCase> Special;

FtzRequirementTy FtzRequirement = FTZ_Any;		FtzRequirementTy FtzRequirement = FTZ_Any;
// Denormal handling is guarded by different attributes depending on the		// Denormal handling is guarded by different attributes depending on the
// type (denormal-fp-math vs denormal-fp-math-f32), take note of halfs.		// type (denormal-fp-math vs denormal-fp-math-f32), take note of halfs.
bool IsHalfTy = false;		bool IsHalfTy = false;
		traUnsubmitted Not Done Reply Inline Actions Huh. I wonder why are we using isHalfTy for figuring out denormal mode. Considering that it seems to be FP32 that is the special case with an attribute of its own, it should've been `IsSingleTy`. Knowing whether it's fp16 or not only allows us to get correct mode for fp16, but not for the other types. tra: Huh. I wonder why are we using isHalfTy for figuring out denormal mode. Considering that it…

SimplifyAction() = default;		SimplifyAction() = default;

SimplifyAction(Intrinsic::ID IID, FtzRequirementTy FtzReq,		SimplifyAction(Intrinsic::ID IID, FtzRequirementTy FtzReq,
bool IsHalfTy = false)		bool IsHalfTy = false)
: IID(IID), FtzRequirement(FtzReq), IsHalfTy(IsHalfTy) {}		: IID(IID), FtzRequirement(FtzReq), IsHalfTy(IsHalfTy) {}

// Cast operations don't have anything to do with FTZ, so we skip that		// Cast operations don't have anything to do with FTZ, so we skip that
▲ Show 20 Lines • Show All 204 Lines • ▼ Show 20 Lines	const SimplifyAction Action = [II]() -> SimplifyAction {
}		}
}();		}();

// If Action.FtzRequirementTy is not satisfied by the module's ftz state, we		// If Action.FtzRequirementTy is not satisfied by the module's ftz state, we
// can bail out now. (Notice that in the case that IID is not an NVVM		// can bail out now. (Notice that in the case that IID is not an NVVM
// intrinsic, we don't have to look up any module metadata, as		// intrinsic, we don't have to look up any module metadata, as
// FtzRequirementTy will be FTZ_Any.)		// FtzRequirementTy will be FTZ_Any.)
if (Action.FtzRequirement != FTZ_Any) {		if (Action.FtzRequirement != FTZ_Any) {
const char *AttrName =		DenormalMode Mode = II->getFunction()->getDenormalMode(
Action.IsHalfTy ? "denormal-fp-math" : "denormal-fp-math-f32";		Action.IsHalfTy ? APFloat::IEEEhalf() : APFloat::IEEEsingle());
StringRef Attr =		bool FtzEnabled = Mode.Output == DenormalMode::PreserveSign;
		arsenmAuthorUnsubmitted Done Reply Inline Actions Looks broken for double to me but I don't know what the options are arsenm: Looks broken for double to me but I don't know what the options are
		traUnsubmitted Not Done Reply Inline Actions Indeed. I do not understand why getDenormalMode appears to single out fp32 as a special case. tra: Indeed. I do not understand why getDenormalMode appears to single out fp32 as a special case.
		arsenmAuthorUnsubmitted Done Reply Inline Actions Because turn on f32 flush is the option the languages actually expose. denormal-fp-math-f32 acts as an override to denormal-fp-math arsenm: Because turn on f32 flush is the option the languages actually expose. denormal-fp-math-f32…
		traUnsubmitted Not Done Reply Inline Actions OK, then the code is (and has been) broken for 'double'. I do not think we want to override denormals mode for doubles using `denormal-fp-math-f32`. On the other hand, it's no more broken than it was before, so we can probably live with that until we figure out how to fix this. https://llvm.org/docs/LangRef.html says: " denormal-fp-math-f32" [...] Not all targets support separately setting the denormal mode per type, and no attempt is made to diagnose unsupported uses. Currently this attribute is respected by the AMDGPU and NVPTX backends. On a side note, I wonder if the denormals handling mode may need to grow a similar override for fp16, which also has .ftz instruction variants. tra: OK, then the code is (and has been) broken for 'double'. I do not think we want to override…
		arsenmAuthorUnsubmitted Done Reply Inline Actions But do those imply a change in the default FP mode, or are they just operations that have flushing semantics in the default mode arsenm: But do those imply a change in the default FP mode, or are they just operations that have…
II->getFunction()->getFnAttribute(AttrName).getValueAsString();
DenormalMode Mode = parseDenormalFPAttribute(Attr);
bool FtzEnabled = Mode.Output != DenormalMode::IEEE;

if (FtzEnabled != (Action.FtzRequirement == FTZ_MustBeOn))		if (FtzEnabled != (Action.FtzRequirement == FTZ_MustBeOn))
return nullptr;		return nullptr;
}		}

// Simplify to target-generic intrinsic.		// Simplify to target-generic intrinsic.
if (Action.IID) {		if (Action.IID) {
SmallVector<Value *, 4> Args(II->args());		SmallVector<Value *, 4> Args(II->args());
▲ Show 20 Lines • Show All 86 Lines • Show Last 20 Lines