This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/TableGen/
-
TableGen/
1/3
TGParser.cpp
-
test/TableGen/
-
TableGen/
-
infix-add.td

Differential D68453

TableGen: Allow 'a+b' in TableGen language
AbandonedPublic

Authored by javed.absar on Oct 4 2019, 3:15 AM.

Download Raw Diff

Details

Reviewers

lebedev.ri
arsenm
RKSimon

Summary

!add(a,b) is often used in td files but its bit cumbersome.
This patch allows one to write 'a+b'. !add(a,b) will still work.

Diff Detail

Event Timeline

javed.absar created this revision.Oct 4 2019, 3:15 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 4 2019, 3:15 AM

Herald added subscribers: hiraditya, wdng. · View Herald Transcript

How far down this road can we go? I would like to see infix operators in TableGen because I believe that it will make it easier to use. However, I'm concerned about just adding add as a special case. Of the other similar operators:

case ADD: Result = "!add"; break;
case MUL: Result = "!mul"; break;
case AND: Result = "!and"; break;
case OR: Result = "!or"; break;
case SHL: Result = "!shl"; break;
case SRA: Result = "!sra"; break;
case SRL: Result = "!srl"; break;
case EQ: Result = "!eq"; break;
case NE: Result = "!ne"; break;
case LE: Result = "!le"; break;
case LT: Result = "!lt"; break;
case GE: Result = "!ge"; break;
case GT: Result = "!gt"; break;

how many can we add as infix operators without ambiguities or other difficulties?

llvm/lib/TableGen/TGParser.cpp
2104	This syntax comment here needs to be updated.

In D68453#1694815, @hfinkel wrote:
How far down this road can we go? I would like to see infix operators in TableGen because I believe that it will make it easier to use. However, I'm concerned about just adding add as a special case. Of the other similar operators:
case ADD: Result = "!add"; break;
case MUL: Result = "!mul"; break;
case AND: Result = "!and"; break;
case OR: Result = "!or"; break;
case SHL: Result = "!shl"; break;
case SRA: Result = "!sra"; break;
case SRL: Result = "!srl"; break;
case EQ: Result = "!eq"; break;
case NE: Result = "!ne"; break;
case LE: Result = "!le"; break;
case LT: Result = "!lt"; break;
case GE: Result = "!ge"; break;
case GT: Result = "!gt"; break;
how many can we add as infix operators without ambiguities or other difficulties?

I have a half baked shunting-algorithm based infix-operator patch to work with cases you mention but that is kind of major work. I thought maybe, as !add is most commonly used, there maybe an appetite for a simple solution like here.

In D68453#1695803, @javed.absar wrote:
In D68453#1694815, @hfinkel wrote:
How far down this road can we go? I would like to see infix operators in TableGen because I believe that it will make it easier to use. However, I'm concerned about just adding add as a special case. Of the other similar operators:
case ADD: Result = "!add"; break;
case MUL: Result = "!mul"; break;
case AND: Result = "!and"; break;
case OR: Result = "!or"; break;
case SHL: Result = "!shl"; break;
case SRA: Result = "!sra"; break;
case SRL: Result = "!srl"; break;
case EQ: Result = "!eq"; break;
case NE: Result = "!ne"; break;
case LE: Result = "!le"; break;
case LT: Result = "!lt"; break;
case GE: Result = "!ge"; break;
case GT: Result = "!gt"; break;
how many can we add as infix operators without ambiguities or other difficulties?
I have a half baked shunting-algorithm based infix-operator patch to work with cases you mention but that is kind of major work. I thought maybe, as !add is most commonly used, there maybe an appetite for a simple solution like here.

To be clear, I'm fine with taking an incremental approach here. It sounds like you've investigated this and there's no major technical impediment, it's just more complicated (and, thus, a more-complicated patch) to deal with all of the operators. Is that correct?

In D68453#1695855, @hfinkel wrote:
In D68453#1695803, @javed.absar wrote:
In D68453#1694815, @hfinkel wrote:
How far down this road can we go? I would like to see infix operators in TableGen because I believe that it will make it easier to use. However, I'm concerned about just adding add as a special case. Of the other similar operators:
case ADD: Result = "!add"; break;
case MUL: Result = "!mul"; break;
case AND: Result = "!and"; break;
case OR: Result = "!or"; break;
case SHL: Result = "!shl"; break;
case SRA: Result = "!sra"; break;
case SRL: Result = "!srl"; break;
case EQ: Result = "!eq"; break;
case NE: Result = "!ne"; break;
case LE: Result = "!le"; break;
case LT: Result = "!lt"; break;
case GE: Result = "!ge"; break;
case GT: Result = "!gt"; break;
how many can we add as infix operators without ambiguities or other difficulties?
I have a half baked shunting-algorithm based infix-operator patch to work with cases you mention but that is kind of major work. I thought maybe, as !add is most commonly used, there maybe an appetite for a simple solution like here.
To be clear, I'm fine with taking an incremental approach here. It sounds like you've investigated this and there's no major technical impediment, it's just more complicated (and, thus, a more-complicated patch) to deal with all of the operators. Is that correct?

absolutely right, Hal.

This doesn't raise any concerns with me but i don't believe i'm most qualified to review this :)
Missing llvm/docs/TableGen/LangIntro.rst, llvm/docs/TableGen/LangRef.rst, llvm/utils/kate/llvm-tablegen.xml changes.

llvm/lib/TableGen/TGParser.cpp
2185–2187	Do you want to make any sanity checks about types of lhs, rhs, final type?

This does seem like a useful addition (heh) to the grammar. There is one thing that can go wrong in the future: parentheses are already reserved for DAGs. Brackets and braces also already have their own purpose. We could designate (( without space for parenthesis, with the risk that ( ( and (( can mean different things, with the first one making sense in DAGs. Though DAG operators are usually defs, so the risk of weirdness may be acceptable.

I also agree that a shunting-based algorithm is the correct direction to go in long-term, and I'm concerned about adding code that adds momentum in the wrong direction. Right now, it's still easy to turn back and do things right, but if we add the change as-is the temptation will be great for the next person to come along and add even more code that goes off into the woods. Can we please get this right now? We can treat # and + as left-associative operators of the same precedence right now (and don't add anything else), which should keep things somewhat simpler, but at least we should be able to avoid using recursion for parsing the RHS.

llvm/lib/TableGen/TGParser.cpp
2185–2187	Yes. `resolveTypes` should be used on the LHS and RHS types and the result used as the type. That makes it consistent with `!add` handling. You don't particularly need to check that it's an IntRecTy as far as I'm concerned.

(outstanding review notes not addressed)

This revision now requires changes to proceed.Oct 16 2019, 10:55 AM

updated based on review comments

Thanks. Any thoughts on the higher-level algorithmic questions?

Abandoning as I think I will try a more holistic approach.

Revision Contents

Path

Size

llvm/

lib/

TableGen/

TGParser.cpp

23 lines

test/

TableGen/

infix-add.td

28 lines

Diff 225313

llvm/lib/TableGen/TGParser.cpp

Show First 20 Lines • Show All 2,094 Lines • ▼ Show 20 Lines
}		}

/// ParseValue - Parse a tblgen value. This returns null on error.		/// ParseValue - Parse a tblgen value. This returns null on error.
///		///
/// Value ::= SimpleValue ValueSuffix*		/// Value ::= SimpleValue ValueSuffix*
/// ValueSuffix ::= '{' BitList '}'		/// ValueSuffix ::= '{' BitList '}'
/// ValueSuffix ::= '[' BitList ']'		/// ValueSuffix ::= '[' BitList ']'
/// ValueSuffix ::= '.' ID		/// ValueSuffix ::= '.' ID
		/// ValueSuffix ::= '+' SimpleValue
///		///
		hfinkelUnsubmitted Done Reply Inline Actions This syntax comment here needs to be updated. hfinkel: This syntax comment here needs to be updated.
Init TGParser::ParseValue(Record CurRec, RecTy *ItemType, IDParseMode Mode) {		Init TGParser::ParseValue(Record CurRec, RecTy *ItemType, IDParseMode Mode) {
Init *Result = ParseSimpleValue(CurRec, ItemType, Mode);		Init *Result = ParseSimpleValue(CurRec, ItemType, Mode);
if (!Result) return nullptr;		if (!Result) return nullptr;

// Parse the suffixes now if present.		// Parse the suffixes now if present.
while (true) {		while (true) {
switch (Lex.getCode()) {		switch (Lex.getCode()) {
default: return Result;		default: return Result;
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	case tgtok::period: {
Result->getAsString() + "'");		Result->getAsString() + "'");
return nullptr;		return nullptr;
}		}
Result = FieldInit::get(Result, FieldName)->Fold(CurRec);		Result = FieldInit::get(Result, FieldName)->Fold(CurRec);
Lex.Lex(); // eat field name		Lex.Lex(); // eat field name
break;		break;
}		}

		case tgtok::plus: {
		SMLoc PlusLoc = Lex.getLoc();

		TypedInit *LHSt = dyn_cast<TypedInit>(Result);
		if (!LHSt) {
		Error(PlusLoc, "LHS of '+' is not typed!");
		return nullptr;
		}
		Lex.Lex(); // Eat the '+'
		Init *RHS = ParseValue(CurRec, ItemType, ParseNameMode);
		TypedInit *RHSt = dyn_cast<TypedInit>(RHS);
		lebedev.riUnsubmitted Not Done Reply Inline Actions Do you want to make any sanity checks about types of lhs, rhs, final type? lebedev.ri: Do you want to make any sanity checks about types of lhs, rhs, final type?
		nhaehnleUnsubmitted Not Done Reply Inline Actions Yes. `resolveTypes` should be used on the LHS and RHS types and the result used as the type. That makes it consistent with `!add` handling. You don't particularly need to check that it's an IntRecTy as far as I'm concerned. nhaehnle: Yes. `resolveTypes` should be used on the LHS and RHS types and the result used as the type.
		if (!RHSt) {
		Error(PlusLoc, "RHS of '+' is not typed!");
		return nullptr;
		}

		RecTy *ResolvedTy = resolveTypes(LHSt->getType(), RHSt->getType());
		Result = BinOpInit::get(BinOpInit::ADD, LHSt, RHSt,
		ResolvedTy)->Fold(CurRec);
		break;
		}

case tgtok::paste:		case tgtok::paste:
SMLoc PasteLoc = Lex.getLoc();		SMLoc PasteLoc = Lex.getLoc();
TypedInit *LHS = dyn_cast<TypedInit>(Result);		TypedInit *LHS = dyn_cast<TypedInit>(Result);
if (!LHS) {		if (!LHS) {
Error(PasteLoc, "LHS of paste is not typed!");		Error(PasteLoc, "LHS of paste is not typed!");
return nullptr;		return nullptr;
}		}

▲ Show 20 Lines • Show All 1,060 Lines • Show Last 20 Lines

llvm/test/TableGen/infix-add.td

This file was added.

				// RUN: llvm-tblgen %s \| FileCheck %s
				// XFAIL: vg_leak

				class Int<int v> {
				int Val = v;
				}

				// CHECK: def AA1 {
				// CHECK-NEXT: int Val = 1;
				// CHECK: def AB1 {
				// CHECK-NEXT: int Val = 11;
				foreach Index = 1-1 in {
				def AA#Index : Int<Index>;
				def AB#Index : Int<Index + 10>;
				}


				// CHECK: def I20 {
				// CHECK-NEXT : int Val = 20;
				def I20 : Int<20>;

				// CHECK: def I21 {
				// CHECK-NEXT : int Val = 21;
				def I21 : Int<!if(!eq(I20.Val,20), I20.Val + 1, 22)>;

				// CHECK: def I300 {
				// CHECK-NEXT: int Val = 300;
				def I300 : Int<155 + 145>;