This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Avoid folding 2 constant operands into an SALU operation
ClosedPublic

Authored by dstuttard on Dec 2 2019, 5:06 AM.

Download Raw Diff

Details

Reviewers

arsenm
foad

Commits

rG46db60683422: AMDGPU: Avoid folding 2 constant operands into an SALU operation

Summary

Catch the (admittedly unusual) case where SIFoldOperands attempts to fold 2
constant operands into the same SALU operation, with neither operand able to be
encoded as an inline constant.

Change-Id: Ibc48d662c9ffd8bbacd154976b0b1c257ace0927

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dstuttard created this revision.Dec 2 2019, 5:06 AM

Herald added a project: Restricted Project. · View Herald TranscriptDec 2 2019, 5:06 AM

Herald added subscribers: llvm-commits, hiraditya, t-tye and 7 others. · View Herald Transcript

Harbormaster completed remote builds in B41703: Diff 231680.Dec 2 2019, 5:06 AM

Formatting

Harbormaster completed remote builds in B41704: Diff 231681.Dec 2 2019, 5:07 AM

I would have preferred to have put this check into isImmOperandLegal in SIInstrInfo.cpp - but that produced lots of lit regressions. Looks like commute operations use this function even when they are swapping rather than replacing the operand (which breaks).

I've never completely liked the way isOperandLegal is structured to always check the legality with respect to all other operands.

llvm/lib/Target/AMDGPU/SIFoldOperands.cpp
437	TRI is already a class member
440	Demorgan this, and I would order TRI.opCanUseInlineConstant(OpInfo.OperandType) before isInlineConstant
445–446	This isn't the right check for a non-inline constant. It won't catch frame index for example.
llvm/test/CodeGen/AMDGPU/fold-sgpr-multi-imm.mir
13	Is this just hiding not handling every possible case in tryConstantFoldOp? Can you add a test with a more exotic instruction we're unlikely to ever try to constant fold here? Another case that should always break is a frame index operand

Made suggested changes

llvm/lib/Target/AMDGPU/SIFoldOperands.cpp
437	I couldn't work out if you were objecting to the name "TRI" or if you wanted me to re-use TRI from the class object. I decided to rename it to SRI since to re-use TRI requires a more pervasive change and an extra cast as well.
llvm/test/CodeGen/AMDGPU/fold-sgpr-multi-imm.mir
13	I wasn't sure if the instruction I've chosen is now obscure enough - give me a different suggestion if you can think of something better. (Also added the frameindex case).

Harbormaster completed remote builds in B41778: Diff 231860.Dec 3 2019, 3:27 AM

arsenm added inline comments.Dec 3 2019, 9:00 AM

llvm/lib/Target/AMDGPU/SIFoldOperands.cpp
437	I mean re-use. I'm surprised a non-target reference to TRI exists anywhere in this pass

arsenm accepted this revision.Dec 3 2019, 9:04 AM

This revision is now accepted and ready to land.Dec 3 2019, 9:04 AM

Closed by commit rG46db60683422: AMDGPU: Avoid folding 2 constant operands into an SALU operation (authored by dstuttard). · Explain WhyDec 4 2019, 2:44 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

Target/

AMDGPU/

SIFoldOperands.cpp

23 lines

test/

CodeGen/

AMDGPU/

fold-sgpr-multi-imm.mir

71 lines

Diff 232067

llvm/lib/Target/AMDGPU/SIFoldOperands.cpp

Show First 20 Lines • Show All 423 Lines • ▼ Show 20 Lines	if (!TII->isOperandLegal(*MI, CommuteOpNo, OpToFold)) {
TII->commuteInstruction(*MI, false, CommuteIdx0, CommuteIdx1);		TII->commuteInstruction(*MI, false, CommuteIdx0, CommuteIdx1);
return false;		return false;
}		}

appendFoldCandidate(FoldList, MI, CommuteOpNo, OpToFold, true);		appendFoldCandidate(FoldList, MI, CommuteOpNo, OpToFold, true);
return true;		return true;
}		}

		// Check the case where we might introduce a second constant operand to a
		// scalar instruction
		if (TII->isSALU(MI->getOpcode())) {
		const MCInstrDesc &InstDesc = MI->getDesc();
		const MCOperandInfo &OpInfo = InstDesc.OpInfo[OpNo];
		const SIRegisterInfo &SRI = TII->getRegisterInfo();
		arsenmUnsubmitted Done Reply Inline Actions TRI is already a class member arsenm: TRI is already a class member
		dstuttardAuthorUnsubmitted Done Reply Inline Actions I couldn't work out if you were objecting to the name "TRI" or if you wanted me to re-use TRI from the class object. I decided to rename it to SRI since to re-use TRI requires a more pervasive change and an extra cast as well. dstuttard: I couldn't work out if you were objecting to the name "TRI" or if you wanted me to re-use TRI…
		arsenmUnsubmitted Not Done Reply Inline Actions I mean re-use. I'm surprised a non-target reference to TRI exists anywhere in this pass arsenm: I mean re-use. I'm surprised a non-target reference to TRI exists anywhere in this pass

		// Fine if the operand can be encoded as an inline constant
		if (OpToFold->isImm()) {
		arsenmUnsubmitted Done Reply Inline Actions Demorgan this, and I would order TRI.opCanUseInlineConstant(OpInfo.OperandType) before isInlineConstant arsenm: Demorgan this, and I would order TRI.opCanUseInlineConstant(OpInfo.OperandType) before…
		if (!SRI.opCanUseInlineConstant(OpInfo.OperandType) \|\|
		!TII->isInlineConstant(*OpToFold, OpInfo)) {
		// Otherwise check for another constant
		for (unsigned i = 0, e = InstDesc.getNumOperands(); i != e; ++i) {
		auto &Op = MI->getOperand(i);
		if (OpNo != i &&
		arsenmUnsubmitted Done Reply Inline Actions This isn't the right check for a non-inline constant. It won't catch frame index for example. arsenm: This isn't the right check for a non-inline constant. It won't catch frame index for example.
		TII->isLiteralConstantLike(Op, OpInfo)) {
		return false;
		}
		}
		}
		}
		}

appendFoldCandidate(FoldList, MI, OpNo, OpToFold);		appendFoldCandidate(FoldList, MI, OpNo, OpToFold);
return true;		return true;
}		}

// If the use operand doesn't care about the value, this may be an operand only		// If the use operand doesn't care about the value, this may be an operand only
// used for register indexing, in which case it is unsafe to fold.		// used for register indexing, in which case it is unsafe to fold.
static bool isUseSafeToFold(const SIInstrInfo *TII,		static bool isUseSafeToFold(const SIInstrInfo *TII,
const MachineInstr &MI,		const MachineInstr &MI,
▲ Show 20 Lines • Show All 1,085 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/fold-sgpr-multi-imm.mir

This file was added.

				# RUN: llc -march=amdgcn -verify-machineinstrs -run-pass si-fold-operands %s -o - \| FileCheck -check-prefix=GCN %s

				# GCN-LABEL: name: test_part_fold{{$}}
				# GCN: %2:sreg_32 = S_ADD_I32 70, %1
				---
				name: test_part_fold
				tracksRegLiveness: true
				body: \|
				bb.0:
				%0:sreg_32 = S_MOV_B32 70
				%1:sreg_32 = S_MOV_B32 80
				%2:sreg_32 = S_ADD_I32 %0, %1, implicit-def $scc
				...
				arsenmUnsubmitted Done Reply Inline Actions Is this just hiding not handling every possible case in tryConstantFoldOp? Can you add a test with a more exotic instruction we're unlikely to ever try to constant fold here? Another case that should always break is a frame index operand arsenm: Is this just hiding not handling every possible case in tryConstantFoldOp? Can you add a test…
				dstuttardAuthorUnsubmitted Done Reply Inline Actions I wasn't sure if the instruction I've chosen is now obscure enough - give me a different suggestion if you can think of something better. (Also added the frameindex case). dstuttard: I wasn't sure if the instruction I've chosen is now obscure enough - give me a different…

				# GCN-LABEL: name: test_inline_const{{$}}
				# GCN: %2:sreg_32 = S_ADD_I32 70, 63
				---
				name: test_inline_const
				tracksRegLiveness: true
				body: \|
				bb.0:
				%0:sreg_32 = S_MOV_B32 70
				%1:sreg_32 = S_MOV_B32 63
				%2:sreg_32 = S_ADD_I32 %0, %1, implicit-def $scc
				...
				# GCN-LABEL: name: test_obscure{{$}}
				# GCN: %2:sreg_32 = S_LSHL2_ADD_U32 70, %1
				---
				name: test_obscure
				tracksRegLiveness: true
				body: \|
				bb.0:
				%0:sreg_32 = S_MOV_B32 70
				%1:sreg_32 = S_MOV_B32 80
				%2:sreg_32 = S_LSHL2_ADD_U32 %0, %1, implicit-def $scc
				...
				# GCN-LABEL: name: test_obscure_inline{{$}}
				# GCN: %2:sreg_32 = S_LSHL2_ADD_U32 70, 63
				---
				name: test_obscure_inline
				tracksRegLiveness: true
				body: \|
				bb.0:
				%0:sreg_32 = S_MOV_B32 70
				%1:sreg_32 = S_MOV_B32 63
				%2:sreg_32 = S_LSHL2_ADD_U32 %0, %1, implicit-def $scc
				...
				# GCN-LABEL: name: test_frameindex{{$}}
				# GCN: %1:sreg_32 = S_ADD_I32 %stack.0, %0
				---
				name: test_frameindex
				tracksRegLiveness: true
				stack:
				- { id: 0, type: default, offset: 0, size: 64, alignment: 16}
				body: \|
				bb.0:
				%0:sreg_32 = S_MOV_B32 70
				%1:sreg_32 = S_ADD_I32 %stack.0, %0, implicit-def $scc
				...
				# GCN-LABEL: name: test_frameindex_inline{{$}}
				# GCN: %1:sreg_32 = S_ADD_I32 %stack.0, 63
				---
				name: test_frameindex_inline
				tracksRegLiveness: true
				stack:
				- { id: 0, type: default, offset: 0, size: 64, alignment: 16}
				body: \|
				bb.0:
				%0:sreg_32 = S_MOV_B32 63
				%1:sreg_32 = S_ADD_I32 %stack.0, %0, implicit-def $scc
				...