diff --git a/llvm/utils/TableGen/jupyter/tablegen_tutorial_part_1.ipynb b/llvm/utils/TableGen/jupyter/tablegen_tutorial_part_1.ipynb new file mode 100644 --- /dev/null +++ b/llvm/utils/TableGen/jupyter/tablegen_tutorial_part_1.ipynb @@ -0,0 +1,1125 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "id": "b1123484", + "metadata": {}, + "source": [ + "## Intro to TableGen Part 1: Classes, Defs, Basic Types and Let" + ] + }, + { + "cell_type": "markdown", + "id": "15a8d57a", + "metadata": {}, + "source": [ + "**Note:** The content in this notebook is adapted from https://llvm.org/docs/TableGen/index.html.\n", + "\n", + "This tutorial will cover:\n", + "* Classes\n", + "* Defs\n", + "* Basic types\n", + "* `let` in various forms\n", + "\n", + "For other topics, see the rest of the notebooks in this same folder." + ] + }, + { + "cell_type": "markdown", + "id": "fbfb640e", + "metadata": {}, + "source": [ + "## What is TableGen?" + ] + }, + { + "cell_type": "markdown", + "id": "4b7d9c3e", + "metadata": {}, + "source": [ + "TableGen is a language used in LLVM to automate the generation of certain types of code. Usually repetitive code that has a common structure. TableGen is used to generate \"records\" that are then processed by a \"backend\" into domain specific code.\n", + "\n", + "The compiler for TableGen is the binary `llvm-tblgen`. This contains the logic to convert TableGen source into records that can then be passed to a TableGen backend.\n", + "\n", + "TableGen allows you to define Classes and Defs (which are instances of classes) but it doesn't encode what to do with that structure. That's what the backend does. The backend converts this structure into something useful, for example C++ code.\n", + "\n", + "These backends are included in the `llvm-tblgen` binary and you can choose which one to run using a command line option. If you don't choose a backend you get a dump of the structure, and that is what this notebook will be showing.\n", + "\n", + "This tutorial will focus on the language itself only. The only thing you need to know now is that in addition to `llvm-tblgen` you will see other `*-tblgen` like `clang-tblgen`. The difference between them is the backends they include.\n", + "\n", + "The default output from `llvm-tblgen` looks like this:" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "id": "88331eb7", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "------------- Defs -----------------\n" + ] + } + ], + "source": [ + "%reset\n", + "// Empty source file" + ] + }, + { + "cell_type": "markdown", + "id": "58f4b3e8", + "metadata": {}, + "source": [ + "**Note:** `%reset` is not a TableGen command but a \"magic\" command to the Jupyter kernel for this notebook. It tells it to clear the cache of recent cell content and start fresh.\n", + "\n", + "No source means no classes and no defs. Let's add a class." + ] + }, + { + "cell_type": "markdown", + "id": "da055b37", + "metadata": {}, + "source": [ + "## Classes" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "id": "249f27bf", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + "}\n", + "------------- Defs -----------------\n" + ] + } + ], + "source": [ + "class C {}" + ] + }, + { + "cell_type": "markdown", + "id": "e92f5c84", + "metadata": {}, + "source": [ + "Followed by a def (definition)." + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "id": "2a101610", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + "}\n", + "------------- Defs -----------------\n", + "def X {\t// C\n", + "}\n" + ] + } + ], + "source": [ + "def X: C;" + ] + }, + { + "cell_type": "markdown", + "id": "6b1b79a5", + "metadata": {}, + "source": [ + "`def` creates an instance of a class. Typically, the main loop of a TableGen backend will look for all defs that are instances of a certain class.\n", + "\n", + "For example if I am generating register information I would look for all defs that are instances of `RegisterInfo` in the example below." + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "id": "964e9ba5", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class RegisterInfo {\n", + "}\n", + "------------- Defs -----------------\n", + "def X0 {\t// RegisterInfo\n", + "}\n", + "def X1 {\t// RegisterInfo\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class RegisterInfo {}\n", + "def X0: RegisterInfo {}\n", + "def X1: RegisterInfo {}" + ] + }, + { + "cell_type": "markdown", + "id": "211bb409", + "metadata": {}, + "source": [ + "## Inheritance" + ] + }, + { + "cell_type": "markdown", + "id": "b2ef33b7", + "metadata": {}, + "source": [ + "Like many other languages with classes, a class in TableGen can inherit properties of another class." + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "id": "b3c23664", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + "}\n", + "class D {\t// C\n", + "}\n", + "------------- Defs -----------------\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {}\n", + "class D : C {}" + ] + }, + { + "cell_type": "markdown", + "id": "c85bfa75", + "metadata": {}, + "source": [ + "Inheritance is done by putting the class you want to inherit from after `:`, before the opening `{`.\n", + "\n", + "You'll know that `D` inherits from `C` by the `// C` comment on the `class D {` line in the output.\n", + "\n", + "Not very interesting though, what are we actually inheriting? The members of the parent class." + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "id": "eb8e210e", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int a = ?;\n", + "}\n", + "class D {\t// C\n", + " int a = ?;\n", + "}\n", + "------------- Defs -----------------\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a;\n", + "}\n", + "class D : C {}" + ] + }, + { + "cell_type": "markdown", + "id": "224caf2d", + "metadata": {}, + "source": [ + "Note that `D` now has the `a` member which was defined in the class `C`.\n", + "\n", + "You can inherit from multiple classes. In that case the order of that matches the order you write the class names after the `:`." + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "id": "842e12a7", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int a = 1;\n", + "}\n", + "class D {\n", + " int a = 2;\n", + "}\n", + "class E {\t// C D\n", + " int a = 2;\n", + "}\n", + "------------- Defs -----------------\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a = 1;\n", + "}\n", + "class D {\n", + " int a = 2;\n", + "}\n", + "class E : C, D {}" + ] + }, + { + "cell_type": "markdown", + "id": "de372733", + "metadata": {}, + "source": [ + "Class `E` first inherits from class `C`. This gives `E` a member `a` with value `1`. Then it inherits from class `D` which also has a member `a` but with a value of `2`. Meaning the final value of `E`'s `a` is `2`.\n", + "\n", + "When a member has the same name this is handled on a \"last one in wins\" basis. Assuming the types match." + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "id": "54782b14", + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + ":7:14: error: New definition of 'a' of type 'int' is incompatible with previous definition of type 'string'\n", + "class E : C, D {}\n", + " ^\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " string a = \"\";\n", + "}\n", + "class D {\n", + " int a = 2;\n", + "}\n", + "class E : C, D {}" + ] + }, + { + "cell_type": "markdown", + "id": "538264ff", + "metadata": {}, + "source": [ + "When they don't match, we get an error. Luckily for us, we're about to learn all about types." + ] + }, + { + "cell_type": "markdown", + "id": "90c1d03f", + "metadata": {}, + "source": [ + "## Types" + ] + }, + { + "cell_type": "markdown", + "id": "3a7b4496", + "metadata": {}, + "source": [ + "TableGen is statically typed with error checking if you try to assign things with mismatched types." + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "id": "232a8bdf", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int a = ?;\n", + " bit b = 0;\n", + " string s = \"Hello\";\n", + "}\n", + "------------- Defs -----------------\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a;\n", + " bit b = 0;\n", + " string s = \"Hello\";\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "c2c6203b", + "metadata": {}, + "source": [ + "Here we've created a class C with integer, bit (1 or 0) and string members. See https://llvm.org/docs/TableGen/ProgRef.html#types for a full list of types.\n", + "\n", + "Note that you do not have to give a member a default value, it can be left uninitialised." + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "id": "9f8149cb", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int a = ?;\n", + " bit b = 0;\n", + " string s = \"Hello\";\n", + "}\n", + "------------- Defs -----------------\n", + "def X {\t// C\n", + " int a = ?;\n", + " bit b = 0;\n", + " string s = \"Hello\";\n", + "}\n" + ] + } + ], + "source": [ + "def X: C {}" + ] + }, + { + "cell_type": "markdown", + "id": "79ead3c0", + "metadata": {}, + "source": [ + "When you make an instance of a class using `def`, that instance gets all the members of the class. Their values will be as set in the class, unless otherwise overridden.\n", + "\n", + "In the case of `a` it also keeps the undefined value. Any backend using that definition would have to check for that case." + ] + }, + { + "cell_type": "code", + "execution_count": 11, + "id": "447e20b6", + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + ":8:13: error: Field 'a' of type 'int' is incompatible with value '\"abc\"' of type 'string'\n", + " int a = \"abc\"\n", + " ^\n", + ":9:1: error: expected ';' after declaration\n", + "}\n", + "^\n" + ] + } + ], + "source": [ + "def Y {\n", + " int a = \"abc\"\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "0cbe813f", + "metadata": {}, + "source": [ + "Here we see the type checking in action. Member `a` has type `int` so we cannot assign a `string` to it." + ] + }, + { + "cell_type": "markdown", + "id": "0db796fc", + "metadata": {}, + "source": [ + "## Let" + ] + }, + { + "cell_type": "markdown", + "id": "d3c41506", + "metadata": {}, + "source": [ + "If we want to override those member values we can use `let` (https://llvm.org/docs/TableGen/ProgRef.html#let-override-fields-in-classes-or-records). This can be done in a couple of ways. The first is where you mark the scope of the `let` using `in {}`.\n", + "\n", + "`let = in {`\n", + "\n", + "The code below says that within the `{}` after the `let`, all `a` should have the value 5." + ] + }, + { + "cell_type": "code", + "execution_count": 12, + "id": "296a5d9c", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int a = 9;\n", + "}\n", + "------------- Defs -----------------\n", + "def X {\t// C\n", + " int a = 5;\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a = 9;\n", + "}\n", + "let a=5 in {\n", + " def X: C {}\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "1bcceae5", + "metadata": {}, + "source": [ + "For multiple names, separate them with a comma." + ] + }, + { + "cell_type": "code", + "execution_count": 25, + "id": "3b640bcb", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int a = ?;\n", + " int b = ?;\n", + "}\n", + "------------- Defs -----------------\n", + "def X {\t// C\n", + " int a = 5;\n", + " int b = 6;\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a;\n", + " int b;\n", + "}\n", + "let a=5, b=6 in {\n", + " def X: C {}\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "89b5cdd9", + "metadata": {}, + "source": [ + "You can also use `let` within a `def`. This means the scope of the `let` is the same as the scope of the `def` (its `{...}`)." + ] + }, + { + "cell_type": "code", + "execution_count": 26, + "id": "826a5449", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int a = 9;\n", + "}\n", + "------------- Defs -----------------\n", + "def X {\t// C\n", + " int a = 5;\n", + "}\n", + "def Y {\t// C\n", + " int a = 9;\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a = 9;\n", + "}\n", + "def X: C {\n", + " let a=5;\n", + "}\n", + "def Y: C {}" + ] + }, + { + "cell_type": "markdown", + "id": "ad57c118", + "metadata": {}, + "source": [ + "Note that `Y` has `a` as `9` because the `let` was only applied to `X`.\n", + "\n", + "It is an error to try to `let` a name that hasn't been defined or to give it a value of the incorrect type." + ] + }, + { + "cell_type": "code", + "execution_count": 14, + "id": "ddc86443", + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + ":5:9: error: Field 'a' of type 'int' is incompatible with value '\"Hello\"' of type 'string'\n", + " let a=\"Hello\";\n", + " ^\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a = 9;\n", + "}\n", + "def X: C {\n", + " let a=\"Hello\";\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "4f720f3d", + "metadata": {}, + "source": [ + "Above, the member `a` was defined but with a type of `int`. We therefore cannot `let` it have a value of type `string`." + ] + }, + { + "cell_type": "code", + "execution_count": 15, + "id": "2748d14c", + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + ":5:11: error: Value 'b' unknown!\n", + " let b=5;\n", + " ^\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a = 9;\n", + "}\n", + "def X: C {\n", + " let b=5;\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "a8b89546", + "metadata": {}, + "source": [ + "Above, class `C` only has one member, `a`. Therefore we get an error trying to override the value of `b` which doesn't exist.\n", + "\n", + "If you have multiple let, the outer scope is applied first then on down to the narrowest scope." + ] + }, + { + "cell_type": "code", + "execution_count": 27, + "id": "92e3bb40", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int a = 9;\n", + "}\n", + "------------- Defs -----------------\n", + "def X {\t// C\n", + " int a = 6;\n", + "}\n", + "def Y {\t// C\n", + " int a = 5;\n", + "}\n", + "def Z {\t// C\n", + " int a = 3;\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int a=9;\n", + "}\n", + "let a=5 in {\n", + " let a=6 in {\n", + " def X: C {}\n", + " }\n", + " def Y: C {}\n", + " def Z: C { let a=3; }\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "777c9390", + "metadata": {}, + "source": [ + "The first let is at what we call the \"top level\". That means the outer most scope in terms of the source code. A bit like a global variable in a C file.\n", + "\n", + "That is applied first and changes `a` from `9` to `5`. `X` is within a further let that changes `a` from `5` to `6`. `Y` however is outside that `let` so its `a` remains `5`.\n", + "\n", + "`Z` takes it a step further with a `let` inside the def. This overrides all the others to make `a` be `3` within `Z`.\n", + "\n", + "That example is quite complex just to demonstrate the feature. Let's look at something more practical.\n", + "\n", + "The use case here is that we are describing registers. The majority of which are 32 bits wide but there are some which are 64 bits wide." + ] + }, + { + "cell_type": "code", + "execution_count": 28, + "id": "626b2c19", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class Register {\n", + " int size = 4;\n", + "}\n", + "------------- Defs -----------------\n", + "def W0 {\t// Register\n", + " int size = 4;\n", + "}\n", + "def X0 {\t// Register\n", + " int size = 8;\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class Register {\n", + " int size=4;\n", + "}\n", + "let size=8 in {\n", + " def X0: Register {}\n", + " // Repeats 31 times...\n", + "}\n", + "def W0: Register {}\n", + "// Repeats 31 times..." + ] + }, + { + "cell_type": "markdown", + "id": "6b90974b", + "metadata": {}, + "source": [ + "(for anyone curious that's AArch64's register naming)\n", + "\n", + "We start by setting a default value of `size` which is 4 (4x8=32 bits) in the class `Register`. Then using a top level `let` we override that value and set it to 8 for all the 64 bit registers." + ] + }, + { + "cell_type": "markdown", + "id": "a15296c4", + "metadata": {}, + "source": [ + "## Classes As Class Members" + ] + }, + { + "cell_type": "markdown", + "id": "b8dffde4", + "metadata": {}, + "source": [ + "In addition to the built in types, class members can be user defined classes." + ] + }, + { + "cell_type": "code", + "execution_count": 29, + "id": "eaa4b9ec", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class Inner {\n", + "}\n", + "class Outer {\n", + " Inner i = ?;\n", + "}\n", + "------------- Defs -----------------\n" + ] + } + ], + "source": [ + "%reset\n", + "class Inner {}\n", + "class Outer {\n", + " Inner i;\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "1faa1c3c", + "metadata": {}, + "source": [ + "Of course that raises the question, how do we construct an instance of `Inner` to use as the value?\n", + "\n", + "We simply use a `def` like we have done before." + ] + }, + { + "cell_type": "code", + "execution_count": 33, + "id": "243afa46", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class Inner {\n", + "}\n", + "class Outer {\n", + " Inner i = AnInner;\n", + "}\n", + "------------- Defs -----------------\n", + "def AnInner {\t// Inner\n", + "}\n", + "def AnOuter {\t// Outer\n", + " Inner i = AnInner;\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class Inner {}\n", + "def AnInner: Inner {}\n", + "class Outer {\n", + " Inner i = AnInner;\n", + "}\n", + "def AnOuter: Outer {}" + ] + }, + { + "cell_type": "markdown", + "id": "a652b8e7", + "metadata": {}, + "source": [ + "## Class Template Arguments" + ] + }, + { + "cell_type": "markdown", + "id": "49ef7b63", + "metadata": {}, + "source": [ + "Class template arguments are used to pass parameters to classes when you define them." + ] + }, + { + "cell_type": "code", + "execution_count": 20, + "id": "888f1d04", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class C {\n", + " int c = C:a;\n", + " int d = C:b;\n", + "}\n", + "------------- Defs -----------------\n", + "def X {\t// C\n", + " int c = 0;\n", + " int d = 1;\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int c = a;\n", + " int d = b;\n", + "}\n", + "def X: C<0, 1> {}" + ] + }, + { + "cell_type": "markdown", + "id": "071294c5", + "metadata": {}, + "source": [ + "This means that to `def` a `C` we must now provide 2 arguments that have type `int` (type checking applies here as it does for class members).\n", + "\n", + "This is going to look familiar if you have written C++. In C++ it might look like:\n", + "```\n", + "template\n", + "class C {\n", + " int c = a;\n", + " int d = b;\n", + "};\n", + "C<0, 1> X;\n", + "```\n", + "\n", + "If templates aren't your thing, another way to think of them is as parameters to the constructor of a class. \n", + "\n", + "For instance Python code might look like this:\n", + "```\n", + "class C(object):\n", + " def __init__(self, a, b):\n", + " self.c = a\n", + " self.d = b\n", + "\n", + "print(C(1, 2).c)\n", + "# prints \"1\"\n", + "```" + ] + }, + { + "cell_type": "code", + "execution_count": 21, + "id": "eb5cb0a9", + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + ":5:8: error: Value not specified for template argument 'C:b' (#1) of parent class 'C'\n", + "def X: C<0> {}\n", + " ^\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int c = a;\n", + " int d = b;\n", + "}\n", + "def X: C<0> {}" + ] + }, + { + "cell_type": "markdown", + "id": "efac82b8", + "metadata": {}, + "source": [ + "When not enough arguments are provided, you get an error.\n", + "\n", + "Below is what happens when one of those arguments is of the wrong type." + ] + }, + { + "cell_type": "code", + "execution_count": 34, + "id": "480b263f", + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + ":5:8: error: Value specified for template argument 'C:b' (#1) is of type string; expected type int: \"hello\"\n", + "def X: C<0, \"hello\"> {}\n", + " ^\n" + ] + } + ], + "source": [ + "%reset\n", + "class C {\n", + " int c = a;\n", + " int d = b;\n", + "}\n", + "def X: C<0, \"hello\"> {}" + ] + }, + { + "cell_type": "markdown", + "id": "fd25ec74", + "metadata": {}, + "source": [ + "Using class template arguments you can enforce a structure on the user of the classes.\n", + "\n", + "In our previous register example I could use this to require the the user pass a value for the size.\n", + "\n", + "Classes also don't have to have all their members derived from template arguments. You can mix and match allowing some members to have default values (essentially, optional arguments to the class constructor).\n", + "\n", + "Back to the registers example. The code below makes the size argument mandatory but the alias optional." + ] + }, + { + "cell_type": "code", + "execution_count": 23, + "id": "e6d4b0e5", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------- Classes -----------------\n", + "class Register {\n", + " int size = Register:_size;\n", + " string alias = \"\";\n", + "}\n", + "------------- Defs -----------------\n", + "def X0 {\t// Register\n", + " int size = 8;\n", + " string alias = \"\";\n", + "}\n", + "def X29 {\t// Register\n", + " int size = 8;\n", + " string alias = \"frame pointer\";\n", + "}\n" + ] + } + ], + "source": [ + "%reset\n", + "class Register {\n", + " int size = _size;\n", + " string alias = \"\";\n", + "}\n", + "def X0: Register<8> {}\n", + "def X29: Register<8> {\n", + " let alias=\"frame pointer\";\n", + "}" + ] + }, + { + "cell_type": "markdown", + "id": "fa230949", + "metadata": {}, + "source": [ + "For `X0` we don't define an alias. Meaning we get the default of `\"\"` (which we'll define to mean that there's\n", + "no alias).\n", + "\n", + "For `X29` we've passed the required size as a class template argument and then added a `let` to the definition's\n", + "body to override and set an alias.\n", + "\n", + "In C++, the equivalent would be:\n", + "```\n", + "// Constructor for class Register\n", + "Register(int size, const char* alias=nullptr) :\n", + "```\n", + "\n", + "**Note:** Unlike a c++ constructor you can't reuse the name between the template argument and the class member.\n", + "Here I have added `_` to the template argument but there's no required style.\n", + "\n", + "Or Python:\n", + "```\n", + "def __init__(self, size, alias=\"\"):\n", + "```\n", + "\n", + "This concludes part 1 of this tutorial. To learn more advanced features, see the following parts which are in this same folder." + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "LLVM TableGen", + "language": "tablegen", + "name": "tablegen" + }, + "language_info": { + "file_extension": ".td", + "mimetype": "text/x-tablegen", + "name": "tablegen", + "pygments_lexer": "text" + } + }, + "nbformat": 4, + "nbformat_minor": 5 +} diff --git a/llvm/utils/TableGen/jupyter/tablegen_tutorial_part_1.md b/llvm/utils/TableGen/jupyter/tablegen_tutorial_part_1.md new file mode 100644 --- /dev/null +++ b/llvm/utils/TableGen/jupyter/tablegen_tutorial_part_1.md @@ -0,0 +1,643 @@ +## Intro to TableGen Part 1: Classes, Defs, Basic Types and Let + +**Note:** The content in this notebook is adapted from https://llvm.org/docs/TableGen/index.html. + +This tutorial will cover: +* Classes +* Defs +* Basic types +* `let` in various forms + +For other topics, see the rest of the notebooks in this same folder. + +## What is TableGen? + +TableGen is a language used in LLVM to automate the generation of certain types of code. Usually repetitive code that has a common structure. TableGen is used to generate "records" that are then processed by a "backend" into domain specific code. + +The compiler for TableGen is the binary `llvm-tblgen`. This contains the logic to convert TableGen source into records that can then be passed to a TableGen backend. + +TableGen allows you to define Classes and Defs (which are instances of classes) but it doesn't encode what to do with that structure. That's what the backend does. The backend converts this structure into something useful, for example C++ code. + +These backends are included in the `llvm-tblgen` binary and you can choose which one to run using a command line option. If you don't choose a backend you get a dump of the structure, and that is what this notebook will be showing. + +This tutorial will focus on the language itself only. The only thing you need to know now is that in addition to `llvm-tblgen` you will see other `*-tblgen` like `clang-tblgen`. The difference between them is the backends they include. + +The default output from `llvm-tblgen` looks like this: + + +```tablegen +%reset +// Empty source file +``` + + ------------- Classes ----------------- + ------------- Defs ----------------- + + +**Note:** `%reset` is not a TableGen command but a "magic" command to the Jupyter kernel for this notebook. It tells it to clear the cache of recent cell content and start fresh. + +No source means no classes and no defs. Let's add a class. + +## Classes + + +```tablegen +class C {} +``` + + ------------- Classes ----------------- + class C { + } + ------------- Defs ----------------- + + +Followed by a def (definition). + + +```tablegen +def X: C; +``` + + ------------- Classes ----------------- + class C { + } + ------------- Defs ----------------- + def X { // C + } + + +`def` creates an instance of a class. Typically, the main loop of a TableGen backend will look for all defs that are instances of a certain class. + +For example if I am generating register information I would look for all defs that are instances of `RegisterInfo` in the example below. + + +```tablegen +%reset +class RegisterInfo {} +def X0: RegisterInfo {} +def X1: RegisterInfo {} +``` + + ------------- Classes ----------------- + class RegisterInfo { + } + ------------- Defs ----------------- + def X0 { // RegisterInfo + } + def X1 { // RegisterInfo + } + + +## Inheritance + +Like many other languages with classes, a class in TableGen can inherit properties of another class. + + +```tablegen +%reset +class C {} +class D : C {} +``` + + ------------- Classes ----------------- + class C { + } + class D { // C + } + ------------- Defs ----------------- + + +Inheritance is done by putting the class you want to inherit from after `:`, before the opening `{`. + +You'll know that `D` inherits from `C` by the `// C` comment on the `class D {` line in the output. + +Not very interesting though, what are we actually inheriting? The members of the parent class. + + +```tablegen +%reset +class C { + int a; +} +class D : C {} +``` + + ------------- Classes ----------------- + class C { + int a = ?; + } + class D { // C + int a = ?; + } + ------------- Defs ----------------- + + +Note that `D` now has the `a` member which was defined in the class `C`. + +You can inherit from multiple classes. In that case the order of that matches the order you write the class names after the `:`. + + +```tablegen +%reset +class C { + int a = 1; +} +class D { + int a = 2; +} +class E : C, D {} +``` + + ------------- Classes ----------------- + class C { + int a = 1; + } + class D { + int a = 2; + } + class E { // C D + int a = 2; + } + ------------- Defs ----------------- + + +Class `E` first inherits from class `C`. This gives `E` a member `a` with value `1`. Then it inherits from class `D` which also has a member `a` but with a value of `2`. Meaning the final value of `E`'s `a` is `2`. + +When a member has the same name this is handled on a "last one in wins" basis. Assuming the types match. + + +```tablegen +%reset +class C { + string a = ""; +} +class D { + int a = 2; +} +class E : C, D {} +``` + + :7:14: error: New definition of 'a' of type 'int' is incompatible with previous definition of type 'string' + class E : C, D {} + ^ + + +When they don't match, we get an error. Luckily for us, we're about to learn all about types. + +## Types + +TableGen is statically typed with error checking if you try to assign things with mismatched types. + + +```tablegen +%reset +class C { + int a; + bit b = 0; + string s = "Hello"; +} +``` + + ------------- Classes ----------------- + class C { + int a = ?; + bit b = 0; + string s = "Hello"; + } + ------------- Defs ----------------- + + +Here we've created a class C with integer, bit (1 or 0) and string members. See https://llvm.org/docs/TableGen/ProgRef.html#types for a full list of types. + +Note that you do not have to give a member a default value, it can be left uninitialised. + + +```tablegen +def X: C {} +``` + + ------------- Classes ----------------- + class C { + int a = ?; + bit b = 0; + string s = "Hello"; + } + ------------- Defs ----------------- + def X { // C + int a = ?; + bit b = 0; + string s = "Hello"; + } + + +When you make an instance of a class using `def`, that instance gets all the members of the class. Their values will be as set in the class, unless otherwise overridden. + +In the case of `a` it also keeps the undefined value. Any backend using that definition would have to check for that case. + + +```tablegen +def Y { + int a = "abc" +} +``` + + :8:13: error: Field 'a' of type 'int' is incompatible with value '"abc"' of type 'string' + int a = "abc" + ^ + :9:1: error: expected ';' after declaration + } + ^ + + +Here we see the type checking in action. Member `a` has type `int` so we cannot assign a `string` to it. + +## Let + +If we want to override those member values we can use `let` (https://llvm.org/docs/TableGen/ProgRef.html#let-override-fields-in-classes-or-records). This can be done in a couple of ways. The first is where you mark the scope of the `let` using `in {}`. + +`let = in {` + +The code below says that within the `{}` after the `let`, all `a` should have the value 5. + + +```tablegen +%reset +class C { + int a = 9; +} +let a=5 in { + def X: C {} +} +``` + + ------------- Classes ----------------- + class C { + int a = 9; + } + ------------- Defs ----------------- + def X { // C + int a = 5; + } + + +For multiple names, separate them with a comma. + + +```tablegen +%reset +class C { + int a; + int b; +} +let a=5, b=6 in { + def X: C {} +} +``` + + ------------- Classes ----------------- + class C { + int a = ?; + int b = ?; + } + ------------- Defs ----------------- + def X { // C + int a = 5; + int b = 6; + } + + +You can also use `let` within a `def`. This means the scope of the `let` is the same as the scope of the `def` (its `{...}`). + + +```tablegen +%reset +class C { + int a = 9; +} +def X: C { + let a=5; +} +def Y: C {} +``` + + ------------- Classes ----------------- + class C { + int a = 9; + } + ------------- Defs ----------------- + def X { // C + int a = 5; + } + def Y { // C + int a = 9; + } + + +Note that `Y` has `a` as `9` because the `let` was only applied to `X`. + +It is an error to try to `let` a name that hasn't been defined or to give it a value of the incorrect type. + + +```tablegen +%reset +class C { + int a = 9; +} +def X: C { + let a="Hello"; +} +``` + + :5:9: error: Field 'a' of type 'int' is incompatible with value '"Hello"' of type 'string' + let a="Hello"; + ^ + + +Above, the member `a` was defined but with a type of `int`. We therefore cannot `let` it have a value of type `string`. + + +```tablegen +%reset +class C { + int a = 9; +} +def X: C { + let b=5; +} +``` + + :5:11: error: Value 'b' unknown! + let b=5; + ^ + + +Above, class `C` only has one member, `a`. Therefore we get an error trying to override the value of `b` which doesn't exist. + +If you have multiple let, the outer scope is applied first then on down to the narrowest scope. + + +```tablegen +%reset +class C { + int a=9; +} +let a=5 in { + let a=6 in { + def X: C {} + } + def Y: C {} + def Z: C { let a=3; } +} +``` + + ------------- Classes ----------------- + class C { + int a = 9; + } + ------------- Defs ----------------- + def X { // C + int a = 6; + } + def Y { // C + int a = 5; + } + def Z { // C + int a = 3; + } + + +The first let is at what we call the "top level". That means the outer most scope in terms of the source code. A bit like a global variable in a C file. + +That is applied first and changes `a` from `9` to `5`. `X` is within a further let that changes `a` from `5` to `6`. `Y` however is outside that `let` so its `a` remains `5`. + +`Z` takes it a step further with a `let` inside the def. This overrides all the others to make `a` be `3` within `Z`. + +That example is quite complex just to demonstrate the feature. Let's look at something more practical. + +The use case here is that we are describing registers. The majority of which are 32 bits wide but there are some which are 64 bits wide. + + +```tablegen +%reset +class Register { + int size=4; +} +let size=8 in { + def X0: Register {} + // Repeats 31 times... +} +def W0: Register {} +// Repeats 31 times... +``` + + ------------- Classes ----------------- + class Register { + int size = 4; + } + ------------- Defs ----------------- + def W0 { // Register + int size = 4; + } + def X0 { // Register + int size = 8; + } + + +(for anyone curious that's AArch64's register naming) + +We start by setting a default value of `size` which is 4 (4x8=32 bits) in the class `Register`. Then using a top level `let` we override that value and set it to 8 for all the 64 bit registers. + +## Classes As Class Members + +In addition to the built in types, class members can be user defined classes. + + +```tablegen +%reset +class Inner {} +class Outer { + Inner i; +} +``` + + ------------- Classes ----------------- + class Inner { + } + class Outer { + Inner i = ?; + } + ------------- Defs ----------------- + + +Of course that raises the question, how do we construct an instance of `Inner` to use as the value? + +We simply use a `def` like we have done before. + + +```tablegen +%reset +class Inner {} +def AnInner: Inner {} +class Outer { + Inner i = AnInner; +} +def AnOuter: Outer {} +``` + + ------------- Classes ----------------- + class Inner { + } + class Outer { + Inner i = AnInner; + } + ------------- Defs ----------------- + def AnInner { // Inner + } + def AnOuter { // Outer + Inner i = AnInner; + } + + +## Class Template Arguments + +Class template arguments are used to pass parameters to classes when you define them. + + +```tablegen +%reset +class C { + int c = a; + int d = b; +} +def X: C<0, 1> {} +``` + + ------------- Classes ----------------- + class C { + int c = C:a; + int d = C:b; + } + ------------- Defs ----------------- + def X { // C + int c = 0; + int d = 1; + } + + +This means that to `def` a `C` we must now provide 2 arguments that have type `int` (type checking applies here as it does for class members). + +This is going to look familiar if you have written C++. In C++ it might look like: +``` +template +class C { + int c = a; + int d = b; +}; +C<0, 1> X; +``` + +If templates aren't your thing, another way to think of them is as parameters to the constructor of a class. + +For instance Python code might look like this: +``` +class C(object): + def __init__(self, a, b): + self.c = a + self.d = b + +print(C(1, 2).c) +# prints "1" +``` + + +```tablegen +%reset +class C { + int c = a; + int d = b; +} +def X: C<0> {} +``` + + :5:8: error: Value not specified for template argument 'C:b' (#1) of parent class 'C' + def X: C<0> {} + ^ + + +When not enough arguments are provided, you get an error. + +Below is what happens when one of those arguments is of the wrong type. + + +```tablegen +%reset +class C { + int c = a; + int d = b; +} +def X: C<0, "hello"> {} +``` + + :5:8: error: Value specified for template argument 'C:b' (#1) is of type string; expected type int: "hello" + def X: C<0, "hello"> {} + ^ + + +Using class template arguments you can enforce a structure on the user of the classes. + +In our previous register example I could use this to require the the user pass a value for the size. + +Classes also don't have to have all their members derived from template arguments. You can mix and match allowing some members to have default values (essentially, optional arguments to the class constructor). + +Back to the registers example. The code below makes the size argument mandatory but the alias optional. + + +```tablegen +%reset +class Register { + int size = _size; + string alias = ""; +} +def X0: Register<8> {} +def X29: Register<8> { + let alias="frame pointer"; +} +``` + + ------------- Classes ----------------- + class Register { + int size = Register:_size; + string alias = ""; + } + ------------- Defs ----------------- + def X0 { // Register + int size = 8; + string alias = ""; + } + def X29 { // Register + int size = 8; + string alias = "frame pointer"; + } + + +For `X0` we don't define an alias. Meaning we get the default of `""` (which we'll define to mean that there's +no alias). + +For `X29` we've passed the required size as a class template argument and then added a `let` to the definition's +body to override and set an alias. + +In C++, the equivalent would be: +``` +// Constructor for class Register +Register(int size, const char* alias=nullptr) : +``` + +**Note:** Unlike a c++ constructor you can't reuse the name between the template argument and the class member. +Here I have added `_` to the template argument but there's no required style. + +Or Python: +``` +def __init__(self, size, alias=""): +``` + +This concludes part 1 of this tutorial. To learn more advanced features, see the following parts which are in this same folder.