This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/AsmPrinter/
-
CodeGen/
-
AsmPrinter/
2/2
AsmPrinter.cpp
-
test/CodeGen/PowerPC/
-
CodeGen/
-
PowerPC/
1/2
instruction-mix-remarks-BCTRL_LWZinto_toc.ll

Differential D113173

[AsmPrinter][ORE] use correct opcode name
ClosedPublic

Authored by shchenz on Nov 4 2021, 1:10 AM.

Download Raw Diff

Details

Reviewers

jsji
fhahn
arsenm
paquette

Group Reviewers

Restricted Project

Commits

rG50acbbe3cd19: [AsmPrinter][ORE] use correct opcode name

Summary

This is found on AIX when using opt-viewer.py.

On AIX, we have a pseudo instruction:

def BCTRL_LWZinto_toc:
  XLForm_2_ext_and_DForm_1<19, 528, 20, 0, 1, 32, (outs),
   (ins memri:$src), "bctrl\n\tlwz 2, $src", IIC_BrB,
   [(PPCbctrl_load_toc iaddr:$src)]>, Requires<[In32BitMode]>;

This pseudo instruction consists of two instructions: bctrl without any explicit operands and lwz with two operands 2 and $src.

getMnemonic introduced in D90040 returns a very strange opcode name for the above instruction, it is bctrl\n\tlwz 2, because AsmWriterInst::AsmWriterInst() treats $ as separator between opcode and operands.

So in the output YAML when generating remarks for asm-printer pass, we get:

- String:          "\n"
- String:          "bctrl\n\tlwz 2, "
- String:          ': '
- INST_bctrl
      lwz 2,: '1' 
- String:          "\n"

The invalid YAML will cause opt-viewer.py to work abnormally.

Seems AsmWriterInst::AsmWriterInst() treating $ as the only separator for opcode name and operands is right, because \n or \t are both treated as a normal character in opcode name or operands name, and \n\t exists very common in instruction mnemonic on some targets. so we can not fix the issue there.

And using bctrl as the opcode name for the pseudo instruction BCTRL_LWZinto_toc also follows the current design, the base class for the instruction form indicates:

// Two joined instructions; used to emit two adjacent instructions as one.
// The itinerary from the first instruction is used for scheduling and
// classification.
class I2 {
}

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

shchenz created this revision.Nov 4 2021, 1:10 AM

Herald added subscribers: hiraditya, nemanjai. · View Herald TranscriptNov 4 2021, 1:10 AM

shchenz requested review of this revision.Nov 4 2021, 1:10 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 4 2021, 1:10 AM

Herald added subscribers: llvm-commits, wdng. · View Herald Transcript

Harbormaster completed remote builds in B132400: Diff 384674.Nov 4 2021, 1:11 AM

jsji added inline comments.Nov 4 2021, 12:02 PM

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
1418	Can we just use getToken()? - auto Name = (Twine("INST_") + KV.first.trim()).str(); + auto Name = (Twine("INST_") + getToken(KV.first).first.trim()).str();

address comment

shchenz marked an inline comment as done.Nov 4 2021, 8:52 PM

shchenz added inline comments.

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
1418	Yes, `getToken()` is simpiler

Harbormaster completed remote builds in B132597: Diff 384948.Nov 4 2021, 8:52 PM

delete the unnecessary default paramater

Harbormaster completed remote builds in B132599: Diff 384950.Nov 4 2021, 8:57 PM

jsji accepted this revision as: jsji.Nov 5 2021, 6:38 AM

This revision is now accepted and ready to land.Nov 5 2021, 6:38 AM

shchenz mentioned this in rGc7d27f90e7c8: [ORE][AsmPrinter] add testcase for D113173; NFC.Nov 7 2021, 5:51 PM

This revision was landed with ongoing or failed builds.Nov 7 2021, 5:55 PM

Closed by commit rG50acbbe3cd19: [AsmPrinter][ORE] use correct opcode name (authored by shchenz). · Explain Why

This revision was automatically updated to reflect the committed changes.

shchenz added a commit: rG50acbbe3cd19: [AsmPrinter][ORE] use correct opcode name.

fhahn added inline comments.Nov 8 2021, 6:44 AM

llvm/test/CodeGen/PowerPC/instruction-mix-remarks-BCTRL_LWZinto_toc.ll
9	shouldn't this be 2 instructions and the count for `tld` is missing at the moment? I am not very familiar with PPC instructions, but the description contains: This pseudo instruction consists of two instructions: bctrl without any explicit operands and lwz with two operands 2 and $src.

shchenz added inline comments.Nov 8 2021, 4:32 PM

llvm/test/CodeGen/PowerPC/instruction-mix-remarks-BCTRL_LWZinto_toc.ll
9	This is a known limitation for this kind of pseudo instructions for scheduling/classification, and now one more, for opcode mnemonic if we want to get the name from `getMnemonic` // Two joined instructions; used to emit two adjacent instructions as one. // The itinerary from the first instruction is used for scheduling and // classification. class I2 { }

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

AsmPrinter/

AsmPrinter.cpp

2 lines

test/

CodeGen/

PowerPC/

instruction-mix-remarks-BCTRL_LWZinto_toc.ll

3 lines

Diff 385384

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

Show First 20 Lines • Show All 1,409 Lines • ▼ Show 20 Lines	if (CanDoExtraAnalysis) {
if (A.second > B.second)		if (A.second > B.second)
return true;		return true;
if (A.second == B.second)		if (A.second == B.second)
return StringRef(A.first) < StringRef(B.first);		return StringRef(A.first) < StringRef(B.first);
return false;		return false;
});		});
R << "BasicBlock: " << ore::NV("BasicBlock", MBB.getName()) << "\n";		R << "BasicBlock: " << ore::NV("BasicBlock", MBB.getName()) << "\n";
for (auto &KV : MnemonicVec) {		for (auto &KV : MnemonicVec) {
auto Name = (Twine("INST_") + KV.first.trim()).str();		auto Name = (Twine("INST_") + getToken(KV.first.trim()).first).str();
		jsjiUnsubmitted Done Reply Inline Actions Can we just use getToken()? - auto Name = (Twine("INST_") + KV.first.trim()).str(); + auto Name = (Twine("INST_") + getToken(KV.first).first.trim()).str(); jsji: Can we just use getToken()? ``` - auto Name = (Twine("INST_") + KV.first.trim()).str()…
		shchenzAuthorUnsubmitted Done Reply Inline Actions Yes, `getToken()` is simpiler shchenz: Yes, `getToken()` is simpiler
R << KV.first << ": " << ore::NV(Name, KV.second) << "\n";		R << KV.first << ": " << ore::NV(Name, KV.second) << "\n";
}		}
ORE->emit(R);		ORE->emit(R);
}		}
}		}

EmittedInsts += NumInstsInFunction;		EmittedInsts += NumInstsInFunction;
MachineOptimizationRemarkAnalysis R(DEBUG_TYPE, "InstructionCount",		MachineOptimizationRemarkAnalysis R(DEBUG_TYPE, "InstructionCount",
▲ Show 20 Lines • Show All 2,212 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/instruction-mix-remarks-BCTRL_LWZinto_toc.ll


	; RUN: llc -verify-machineinstrs -mattr=-altivec -mtriple powerpc64-ibm-aix-xcoff \			; RUN: llc -verify-machineinstrs -mattr=-altivec -mtriple powerpc64-ibm-aix-xcoff \
	; RUN: -pass-remarks-output=%t -pass-remarks=asm-printer -mcpu=pwr4 -o - %s			; RUN: -pass-remarks-output=%t -pass-remarks=asm-printer -mcpu=pwr4 -o - %s
	; RUN: FileCheck --input-file=%t %s			; RUN: FileCheck --input-file=%t %s

	; CHECK: - String: "\n"			; CHECK: - String: "\n"
	; CHECK: - String: "bctrl\n\tld 2, "			; CHECK: - String: "bctrl\n\tld 2, "
	; CHECK: - String: ': '			; CHECK: - String: ': '
	; CHECK: - INST_bctrl			; CHECK: - INST_bctrl: '1'
				fhahnUnsubmitted Not Done Reply Inline Actions shouldn't this be 2 instructions and the count for `tld` is missing at the moment? I am not very familiar with PPC instructions, but the description contains: This pseudo instruction consists of two instructions: bctrl without any explicit operands and lwz with two operands 2 and $src. fhahn: shouldn't this be 2 instructions and the count for `tld` is missing at the moment? I am not…
				shchenzAuthorUnsubmitted Done Reply Inline Actions This is a known limitation for this kind of pseudo instructions for scheduling/classification, and now one more, for opcode mnemonic if we want to get the name from `getMnemonic` // Two joined instructions; used to emit two adjacent instructions as one. // The itinerary from the first instruction is used for scheduling and // classification. class I2 { } shchenz: This is a known limitation for this kind of pseudo instructions for scheduling/classification…
	; CHECK: ld 2,: '1'
	; CHECK: - String: "\n"			; CHECK: - String: "\n"


	define void @callThroughPtrWithArgs(void (i32, i16, i64)* nocapture) {			define void @callThroughPtrWithArgs(void (i32, i16, i64)* nocapture) {
	tail call void %0(i32 signext 1, i16 zeroext 2, i64 3)			tail call void %0(i32 signext 1, i16 zeroext 2, i64 3)
	ret void			ret void
	}			}