This is an archive of the discontinued LLVM Phabricator instance.

I was thinking "would it be better to move this logic down into PPCAsmParser::MatchRegisterName? Well, I would suppose that would depend on if the % prefix on registers is valid in other contexts, for instance maybe other places that directly call PPCAsmParser::MatchRegisterName rather than `PPCAsmParser::tryParseRegister which you've modified here.

Right away, the next caller of PPCAsmParser::MatchRegisterName I found is in PPCAsmParser::ParseOperand:

1480   switch (getLexer().getKind()) {                                                                                                                                                     
1481   // Special handling for register names.  These are interpreted                                                                                                                      
1482   // as immediates corresponding to the register number.                                                                                                                              
1483   case AsmToken::Percent:                                                                                                                                                             
1484     Parser.Lex(); // Eat the '%'.                                                                                                                                                     
1485     unsigned RegNo;                                                                                                                                                                   
1486     int64_t IntVal;                                                                                                                                                                   
1487     if (MatchRegisterName(RegNo, IntVal))                                                                                                                                             
1488       return Error(S, "invalid register name");
...
1553     switch (getLexer().getKind()) {                                                                                                                                                   
1554     case AsmToken::Percent:                                                                                                                                                           
1555       Parser.Lex(); // Eat the '%'.                                                                                                                                                   
1556       unsigned RegNo;                                                                                                                                                                 
1557       if (MatchRegisterName(RegNo, IntVal))

So it looks like it's currently idiosyncratic that some callers lex the % and others do not. I highly doubt that's intentional. In that case, would you mind please sinking the lexing of % into PPCAsmParser::MatchRegisterName, then removing the existing % lexing, that way all sites that expect to parse registers conditionally with % prefixed do so in one place? I guess the existing sites expect there to be a % unconditionally; I suspect it should be conditional, at least to match GNU as portability concerns.

In D91735#2403921, @nickdesaulniers wrote:

I was thinking "would it be better to move this logic down into PPCAsmParser::MatchRegisterName? Well, I would suppose that would depend on if the % prefix on registers is valid in other contexts, for instance maybe other places that directly call PPCAsmParser::MatchRegisterName rather than `PPCAsmParser::tryParseRegister which you've modified here.

When I do that, I get several register parsing failures, including for this testcase. I wanted to restrict this change to the CFI directives (this function seems to be used by those instead of normal functions). X86 has similar code in the same place.

I think what may be happening is that the MatchRegisterName function is looking at the token that's sometimes already lexed, and so isn't always "current" when the function calls.

In D91735#2404079, @void wrote:

In D91735#2403921, @nickdesaulniers wrote:

I was thinking "would it be better to move this logic down into PPCAsmParser::MatchRegisterName? Well, I would suppose that would depend on if the % prefix on registers is valid in other contexts, for instance maybe other places that directly call PPCAsmParser::MatchRegisterName rather than `PPCAsmParser::tryParseRegister which you've modified here.

When I do that, I get several register parsing failures, including for this testcase. I wanted to restrict this change to the CFI directives (this function seems to be used by those instead of normal functions). X86 has similar code in the same place.

Oh?

diff --git a/llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp b/llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp
index 458edf71d6c8..5e4d17ec9a13 100644
--- a/llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp
+++ b/llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp
@@ -1201,6 +1201,8 @@ bool PPCAsmParser::MatchAndEmitInstruction(SMLoc IDLoc, unsigned &Opcode,
 }
 
 bool PPCAsmParser::MatchRegisterName(unsigned &RegNo, int64_t &IntVal) {
+  if (getParser().getTok().is(AsmToken::Percent))
+    getParser().Lex(); // Eat the '%'.
   if (getParser().getTok().is(AsmToken::Identifier)) {
     StringRef Name = getParser().getTok().getString();
     if (Name.equals_lower("lr")) {
@@ -1481,7 +1483,6 @@ bool PPCAsmParser::ParseOperand(OperandVector &Operands) {
   // Special handling for register names.  These are interpreted
   // as immediates corresponding to the register number.
   case AsmToken::Percent:
-    Parser.Lex(); // Eat the '%'.
     unsigned RegNo;
     int64_t IntVal;
     if (MatchRegisterName(RegNo, IntVal))
@@ -1552,7 +1553,6 @@ bool PPCAsmParser::ParseOperand(OperandVector &Operands) {
     int64_t IntVal;
     switch (getLexer().getKind()) {
     case AsmToken::Percent:
-      Parser.Lex(); // Eat the '%'.
       unsigned RegNo;
       if (MatchRegisterName(RegNo, IntVal))
         return Error(S, "invalid register name");

passes all tests for me, other than this newly added one; for some reason we get an additional newline before the blr instruction:

llvm-mc -triple powerpc64le-unknown-unknown /android0/llvm-project/llvm/test/MC/PowerPC/cfi-register-directive-parse.s
        .text



        .globl  __test1
__test1:
        .cfi_startproc
        mflr    12
        .cfi_register lr, r12

        blr
        .cfi_endproc

(Though maybe that's a bug in MCAsmStreamer for the assembler .cfi directives, or how registers are printed? Maybe DwarfRegNumForCFI is related?)

PPCAsmParser::ParseOperand also has a comment that %rNN is used for ELF but not Macho; maybe this should be conditioned on isDarwin()? Though that comment and the current implementation of PPCAsmParser::ParseOperand don't look like they match to me. Also, the above print out is curious to me how r12 gets printed without r prefix for mflr. LLVM looks a little inconsistent here in how PPC asm register operands are printed.

Move % eater to MatchRegisterName.

I'm not sure how much ppc Darwin is much relevant anymore. If anyone cares, we can always make the % token consumption conditional.

This revision is now accepted and ready to land.Nov 19 2020, 5:33 PM

In D91735#2406987, @nickdesaulniers wrote:

I'm not sure how much ppc Darwin is much relevant anymore. If anyone cares, we can always make the % token consumption conditional.

Nobody cares. ppc Darwin has been deleted (I deleted lots of portion too...)

MaskRay added inline comments.Nov 19 2020, 5:39 PM

llvm/test/MC/PowerPC/cfi-register-directive-parse.s
6
11	.text is redundant.

MaskRay accepted this revision.Nov 19 2020, 5:41 PM

This revision was landed with ongoing or failed builds.Nov 19 2020, 6:20 PM

Closed by commit rGb2f663073917: [PowerPC] Allow a '%' prefix for registers in CFI directives (authored by void). · Explain Why

This revision was automatically updated to reflect the committed changes.

void added a commit: rGb2f663073917: [PowerPC] Allow a '%' prefix for registers in CFI directives.

Harbormaster completed remote builds in B79546: Diff 306566.Nov 19 2020, 6:34 PM

Revision Contents

Path

Size

llvm/

lib/

Target/

PowerPC/

AsmParser/

PPCAsmParser.cpp

77 lines

test/

MC/

PowerPC/

cfi-register-directive-parse.s

16 lines

Diff 306578

llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp

Show First 20 Lines • Show All 1,195 Lines • ▼ Show 20 Lines	case Match_InvalidOperand: {
return Error(ErrorLoc, "invalid operand for instruction");		return Error(ErrorLoc, "invalid operand for instruction");
}		}
}		}

llvm_unreachable("Implement any new match types added!");		llvm_unreachable("Implement any new match types added!");
}		}

bool PPCAsmParser::MatchRegisterName(unsigned &RegNo, int64_t &IntVal) {		bool PPCAsmParser::MatchRegisterName(unsigned &RegNo, int64_t &IntVal) {
if (getParser().getTok().is(AsmToken::Identifier)) {		if (getParser().getTok().is(AsmToken::Percent))
		getParser().Lex(); // Eat the '%'.

		if (!getParser().getTok().is(AsmToken::Identifier))
		return true;

StringRef Name = getParser().getTok().getString();		StringRef Name = getParser().getTok().getString();
if (Name.equals_lower("lr")) {		if (Name.equals_lower("lr")) {
RegNo = isPPC64()? PPC::LR8 : PPC::LR;		RegNo = isPPC64() ? PPC::LR8 : PPC::LR;
IntVal = 8;		IntVal = 8;
} else if (Name.equals_lower("ctr")) {		} else if (Name.equals_lower("ctr")) {
RegNo = isPPC64()? PPC::CTR8 : PPC::CTR;		RegNo = isPPC64() ? PPC::CTR8 : PPC::CTR;
IntVal = 9;		IntVal = 9;
} else if (Name.equals_lower("vrsave")) {		} else if (Name.equals_lower("vrsave")) {
RegNo = PPC::VRSAVE;		RegNo = PPC::VRSAVE;
IntVal = 256;		IntVal = 256;
} else if (Name.startswith_lower("r") &&		} else if (Name.startswith_lower("r") &&
!Name.substr(1).getAsInteger(10, IntVal) && IntVal < 32) {		!Name.substr(1).getAsInteger(10, IntVal) && IntVal < 32) {
RegNo = isPPC64()? XRegs[IntVal] : RRegs[IntVal];		RegNo = isPPC64() ? XRegs[IntVal] : RRegs[IntVal];
} else if (Name.startswith_lower("f") &&		} else if (Name.startswith_lower("f") &&
!Name.substr(1).getAsInteger(10, IntVal) && IntVal < 32) {		!Name.substr(1).getAsInteger(10, IntVal) && IntVal < 32) {
RegNo = FRegs[IntVal];		RegNo = FRegs[IntVal];
} else if (Name.startswith_lower("vs") &&		} else if (Name.startswith_lower("vs") &&
!Name.substr(2).getAsInteger(10, IntVal) && IntVal < 64) {		!Name.substr(2).getAsInteger(10, IntVal) && IntVal < 64) {
RegNo = VSRegs[IntVal];		RegNo = VSRegs[IntVal];
} else if (Name.startswith_lower("v") &&		} else if (Name.startswith_lower("v") &&
!Name.substr(1).getAsInteger(10, IntVal) && IntVal < 32) {		!Name.substr(1).getAsInteger(10, IntVal) && IntVal < 32) {
RegNo = VRegs[IntVal];		RegNo = VRegs[IntVal];
} else if (Name.startswith_lower("cr") &&		} else if (Name.startswith_lower("cr") &&
!Name.substr(2).getAsInteger(10, IntVal) && IntVal < 8) {		!Name.substr(2).getAsInteger(10, IntVal) && IntVal < 8) {
RegNo = CRRegs[IntVal];		RegNo = CRRegs[IntVal];
} else		} else
return true;		return true;
getParser().Lex();		getParser().Lex();
return false;		return false;
}		}
return true;
}

bool PPCAsmParser::		bool PPCAsmParser::
ParseRegister(unsigned &RegNo, SMLoc &StartLoc, SMLoc &EndLoc) {		ParseRegister(unsigned &RegNo, SMLoc &StartLoc, SMLoc &EndLoc) {
if (tryParseRegister(RegNo, StartLoc, EndLoc) != MatchOperand_Success)		if (tryParseRegister(RegNo, StartLoc, EndLoc) != MatchOperand_Success)
return TokError("invalid register name");		return TokError("invalid register name");
return false;		return false;
}		}

▲ Show 20 Lines • Show All 230 Lines • ▼ Show 20 Lines	bool PPCAsmParser::ParseOperand(OperandVector &Operands) {
SMLoc S = Parser.getTok().getLoc();		SMLoc S = Parser.getTok().getLoc();
SMLoc E = SMLoc::getFromPointer(Parser.getTok().getLoc().getPointer() - 1);		SMLoc E = SMLoc::getFromPointer(Parser.getTok().getLoc().getPointer() - 1);
const MCExpr *EVal;		const MCExpr *EVal;

// Attempt to parse the next token as an immediate		// Attempt to parse the next token as an immediate
switch (getLexer().getKind()) {		switch (getLexer().getKind()) {
// Special handling for register names. These are interpreted		// Special handling for register names. These are interpreted
// as immediates corresponding to the register number.		// as immediates corresponding to the register number.
case AsmToken::Percent:		case AsmToken::Percent: {
Parser.Lex(); // Eat the '%'.
unsigned RegNo;		unsigned RegNo;
int64_t IntVal;		int64_t IntVal;
if (MatchRegisterName(RegNo, IntVal))		if (MatchRegisterName(RegNo, IntVal))
return Error(S, "invalid register name");		return Error(S, "invalid register name");

Operands.push_back(PPCOperand::CreateImm(IntVal, S, E, isPPC64()));		Operands.push_back(PPCOperand::CreateImm(IntVal, S, E, isPPC64()));
return false;		return false;
		}
case AsmToken::Identifier:		case AsmToken::Identifier:
case AsmToken::LParen:		case AsmToken::LParen:
case AsmToken::Plus:		case AsmToken::Plus:
case AsmToken::Minus:		case AsmToken::Minus:
case AsmToken::Integer:		case AsmToken::Integer:
case AsmToken::Dot:		case AsmToken::Dot:
case AsmToken::Dollar:		case AsmToken::Dollar:
case AsmToken::Exclaim:		case AsmToken::Exclaim:
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	bool PPCAsmParser::ParseOperand(OperandVector &Operands) {

// Otherwise, check for D-form memory operands		// Otherwise, check for D-form memory operands
if (!TLSCall && getLexer().is(AsmToken::LParen)) {		if (!TLSCall && getLexer().is(AsmToken::LParen)) {
Parser.Lex(); // Eat the '('.		Parser.Lex(); // Eat the '('.
S = Parser.getTok().getLoc();		S = Parser.getTok().getLoc();

int64_t IntVal;		int64_t IntVal;
switch (getLexer().getKind()) {		switch (getLexer().getKind()) {
case AsmToken::Percent:		case AsmToken::Percent: {
Parser.Lex(); // Eat the '%'.
unsigned RegNo;		unsigned RegNo;
if (MatchRegisterName(RegNo, IntVal))		if (MatchRegisterName(RegNo, IntVal))
return Error(S, "invalid register name");		return Error(S, "invalid register name");
break;		break;
		}
case AsmToken::Integer:		case AsmToken::Integer:
if (isDarwin())		if (isDarwin())
return Error(S, "unexpected integer value");		return Error(S, "unexpected integer value");
else if (getParser().parseAbsoluteExpression(IntVal) \|\| IntVal < 0 \|\|		else if (getParser().parseAbsoluteExpression(IntVal) \|\| IntVal < 0 \|\|
IntVal > 31)		IntVal > 31)
return Error(S, "invalid register number");		return Error(S, "invalid register number");
break;		break;
case AsmToken::Identifier:		case AsmToken::Identifier:
▲ Show 20 Lines • Show All 320 Lines • Show Last 20 Lines

llvm/test/MC/PowerPC/cfi-register-directive-parse.s

This file was added.

# RUN: llvm-mc -triple powerpc64le-unknown-unknown %s 2>&1 | FileCheck %s

# Test that CFI directives can handle registers with a '%' prefix.

# CHECK-LABEL: __test1

# CHECK: .cfi_startproc

MaskRayUnsubmitted

Not Done

# Test that CFI directives can handle registers with a '%' prefix.

- # CHECK-LABEL: __test1

+ # CHECK-LABEL: __test1:

# CHECK: .cfi_startproc

MaskRay:

# CHECK-NEXT: mflr 12

# CHECK-NEXT: .cfi_register lr, r12

.globl __test1

__test1:

MaskRayUnsubmitted

Not Done

.text is redundant.

MaskRay: .text is redundant.

.cfi_startproc

mflr %r12

.cfi_register lr,%r12

blr

.cfi_endproc

This is an archive of the discontinued LLVM Phabricator instance.

[PowerPC] Allow a '%' prefix for registers in CFI directivesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 306578

llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp

llvm/test/MC/PowerPC/cfi-register-directive-parse.s

[PowerPC] Allow a '%' prefix for registers in CFI directives
ClosedPublic