This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/docs/
-
docs/
2/3
LangRef.rst

Differential D101185

[LangRef] tbaa: type names can be used as hints to optimizations
AbandonedPublic

Authored by aqjune on Apr 23 2021, 10:33 AM.

Download Raw Diff

Details

Reviewers

jeroen.dobbelaere
jdoerfert
lebedev.ri
nikic
nlopes

Summary

As discussed in D100717, this patch states that if TBAA metadata node's type name is e.g.,
any pointer or vtable pointer, it can be used as a hint to drive further optimizations/passes.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	2,170 ms	x64 debian > libarcher.races::lock-unrelated.c

Event Timeline

aqjune created this revision.Apr 23 2021, 10:33 AM

Herald added a subscriber: kosarev. · View Herald TranscriptApr 23 2021, 10:33 AM

aqjune requested review of this revision.Apr 23 2021, 10:33 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 23 2021, 10:33 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

aqjune added reviewers: lebedev.ri, nikic, nlopes.Apr 23 2021, 10:39 AM

aqjune added inline comments.Apr 23 2021, 10:41 AM

llvm/docs/LangRef.rst
5794	This sentence ('LLVM does not assign ... an `MDString`') became a separate paragraph (line 5807) and the explanation is added. Similarly for the analogous sentence in the next paragraph as well.

jdoerfert added inline comments.Apr 23 2021, 10:42 AM

llvm/docs/LangRef.rst
5807–5810	I think we should make it open ended right away, and stress that it is for heuristics, not correctness.

Address jdoerfert's comment

As suggested in D100717, I'll send a mail to llvm-dev for visibility.

aqjune retitled this revision from [LangRef] tbaa: 'any pointer' and 'vtable pointer' type names can be used to [LangRef] tbaa: type names can be used as hints to optimizations.Apr 23 2021, 11:07 AM

aqjune edited the summary of this revision. (Show Details)

I am somewhat confused on performance vs correctness - D100717 refers to a miscompile, would adding this behavior also clear miscompiles?

Harbormaster completed remote builds in B100625: Diff 340106.Apr 23 2021, 1:23 PM

Harbormaster completed remote builds in B100628: Diff 340110.Apr 23 2021, 1:39 PM

+1 from me.

In D101185#2713286, @penzn wrote:

I am somewhat confused on performance vs correctness - D100717 refers to a miscompile, would adding this behavior also clear miscompiles?

Having this will allow instcombine to produce less patterns that the miscompiling fold is folding away,
thus moving towards potentially being able to remove the miscompiling fold one day.
So no, not really, this in itself won't do anything about those miscompiles.

If no one objects by friday (in next 5 days), i think we can proceed here?

This revision is now accepted and ready to land.Apr 26 2021, 6:44 AM

jdoerfert accepted this revision.Apr 26 2021, 7:58 AM

On a high level, encoding more Clang specifics into the tbaa spec and using it as heuristics seems a bit unfortunate to me, as it may pessimize frontends that for various reasons cannot use tbaa, especially because the additional information does not seem tbaa specific to me.

I understand it is very convenient to use tbaa in this case, but I am worried about non-Clang frontends once this heuristic becomes important for performance. What will we suggest to frontends that want to opt-in to the optimizations (but tbaa in general is not suitable for them)?

llvm/docs/LangRef.rst
5811	I think from that formulation it is still not clear what kind of meaning is assigned to those special strings. Unless I am missing something, it is still not clear where and how frontends should use/emit those special names. I think it would be good if the description would be clear on how new frontends should use the special names.

In D101185#2716977, @fhahn wrote:

On a high level, encoding more Clang specifics into the tbaa spec and using it as heuristics seems a bit unfortunate to me, as it may pessimize frontends that for various reasons cannot use tbaa, especially because the additional information does not seem tbaa specific to me.

I understand it is very convenient to use tbaa in this case, but I am worried about non-Clang frontends once this heuristic becomes important for performance. What will we suggest to frontends that want to opt-in to the optimizations (but tbaa in general is not suitable for them)?

We could introduce something like !tb.struct which point to the similar structured information as !tbaa.struct. When that is present, that information could also be used to expand llvm.memcpy, but without enforcing the type based aliasing implications.

In D101185#2716977, @fhahn wrote:

On a high level, encoding more Clang specifics into the tbaa spec and using it as heuristics seems a bit unfortunate to me, as it may pessimize frontends that for various reasons cannot use tbaa, especially because the additional information does not seem tbaa specific to me.

I understand it is very convenient to use tbaa in this case, but I am worried about non-Clang frontends once this heuristic becomes important for performance. What will we suggest to frontends that want to opt-in to the optimizations (but tbaa in general is not suitable for them)?

This is also my concern. It would be very helpful is somebody familiar with TBAA could clarify whether there is any way to add the necessary metadata without having an effect on aliasing. I don't think it's possibly to just disable the TBAA analysis, because the optimization pipeline is generally not under your control (in cross-language linker-plugin LTO scenarios).

From my reading of LangRef, it might be possible to have a type hierarchy of "Root" <- "Dummy" <- "any pointer", where "any pointer" is used to annotate pointer types and "Dummy" for anything else. Then aliasing checks between "Dummy" and "any pointer" will always report aliasing, as it's reachable in one direction. More generally, as long as the type "hierarchy" is a linear chain, no useful aliasing information can be derived. Is that correct? Would this work in practice without causing other complications?

I do think what @jeroen.dobbelaere suggests would be the right way to approach this. This doesn't even have to be in addition to !tbaa.struct, but can be a replacement for it. In particular, I'm thinking that instead of having an {offset, size, tbaa} encoding, it could be {offset, size, type, tbaa}, where type is some well-defined type indicator to use for this optimization, while tbaa is an optional TBAA reference for the member. Frontends with type-based aliasing models would populate the last element, frontends without it wouldn't.

In D101185#2731194, @nikic wrote:

From my reading of LangRef, it might be possible to have a type hierarchy of "Root" <- "Dummy" <- "any pointer", where "any pointer" is used to annotate pointer types and "Dummy" for anything else. Then aliasing checks between "Dummy" and "any pointer" will always report aliasing, as it's reachable in one direction. More generally, as long as the type "hierarchy" is a linear chain, no useful aliasing information can be derived. Is that correct? Would this work in practice without causing other complications?

That will indeed work today. One thing that I find annoying with it, is that it feels like 'fighting to not use tbaa', but it is valid and it will have the intended effect.

I do think what @jeroen.dobbelaere suggests would be the right way to approach this. This doesn't even have to be in addition to !tbaa.struct, but can be a replacement for it. In particular, I'm thinking that instead of having an {offset, size, tbaa} encoding, it could be {offset, size, type, tbaa}, where type is some well-defined type indicator to use for this optimization, while tbaa is an optional TBAA reference for the member. Frontends with type-based aliasing models would populate the last element, frontends without it wouldn't.

Something like that would also work.

FWIW, in the AA call we concluded that we should go away from "tbaa" metadata towards "type" metadata that is resuable. "tbaa" could be a subset, maybe identified with a flag, which will allow TBAA to use it.

Thank you for the inputs as well!
Nuno and I had discussion as well and determined that uses of tbaa might not be good for this purpose as well.
I will revisit this and D100717 when I have enough bandwidth.

Matt added a subscriber: Matt.May 6 2021, 7:31 AM

aqjune abandoned this revision.Jul 27 2021, 5:38 PM

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

29 lines

Diff 340110

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,784 Lines • ▼ Show 20 Lines

Representation Representation

"""""""""""""" """"""""""""""

The root node of a TBAA type hierarchy is an ``MDNode`` with 0 operands or The root node of a TBAA type hierarchy is an ``MDNode`` with 0 operands or

with exactly one ``MDString`` operand. with exactly one ``MDString`` operand.

Scalar type descriptors are represented as an ``MDNode`` s with two Scalar type descriptors are represented as an ``MDNode`` s with two

operands. The first operand is an ``MDString`` denoting the name of the operands. The first operand is an ``MDString`` denoting the name of the

struct type. LLVM does not assign meaning to the value of this operand, it scalar type. The second operand is an ``MDNode`` which points to the parent

only cares about it being an ``MDString``. The second operand is an for said scalar type descriptor, which is either another scalar type

aqjuneAuthorUnsubmitted

Done

This sentence ('LLVM does not assign ... an `MDString`') became a separate paragraph (line 5807) and the explanation is added.
Similarly for the analogous sentence in the next paragraph as well.

aqjune: This sentence ('LLVM does not assign ... an ``MDString``') became a separate paragraph (line…

``MDNode`` which points to the parent for said scalar type descriptor, descriptor or the TBAA root. Scalar type descriptors can have an optional

which is either another scalar type descriptor or the TBAA root. Scalar third argument, but that must be the constant integer zero.

type descriptors can have an optional third argument, but that must be the

constant integer zero.

Struct type descriptors are represented as ``MDNode`` s with an odd number Struct type descriptors are represented as ``MDNode`` s with an odd number

of operands greater than 1. The first operand is an ``MDString`` denoting of operands greater than 1. The first operand is an ``MDString`` denoting

the name of the struct type. Like in scalar type descriptors the actual the name of the struct type. After the name operand, the struct type

value of this name operand is irrelevant to LLVM. After the name operand, descriptors have a sequence of alternating ``MDNode`` and ``ConstantInt``

the struct type descriptors have a sequence of alternating ``MDNode`` and operands. With N starting from 1, the 2N - 1 th operand, an ``MDNode``,

``ConstantInt`` operands. With N starting from 1, the 2N - 1 th operand, denotes a contained field, and the 2N th operand, a ``ConstantInt``, is the

an ``MDNode``, denotes a contained field, and the 2N th operand, a offset of the said contained field. The offsets must be in non-decreasing

``ConstantInt``, is the offset of the said contained field. The offsets order.

must be in non-decreasing order.

The names in the first operand of a scalar or struct type description can be

chosen freely by the frontend. However, optimizations might use them as a

heuristics for better performance without affecting correctness. As an

example, ``"any pointer"`` and ``"vtable pointer"`` might be recognized and

jdoerfertUnsubmitted

Done

order.

- LLVM does not assign meaning to the name of the type that is the value of the

- first operand of scalar type descriptors or struct type descriptors. But,

- ``"any pointer"`` and ``"vtable pointer"`` are exceptions and they might be

- used to drive optimization passes/code generation.

+ The names in the first operand of a scalar or struct type description can be

+ chosen freely, however, optimizations might use them for heuristics. That means

+ the value will not influence correctness but potentially performance. As an

+ example, ``"any pointer"`` and ``"vtable pointer"`` might be recognized and used

+ to select a different, but equivalent, code generation.

Access tags are represented as ``MDNode`` s with either 3 or 4 operands.

I think we should make it open ended right away, and stress that it is for heuristics, not correctness.

jdoerfert: I think we should make it open ended right away, and stress that it is for heuristics, not…

used to select a different, but equivalent, code generation.

fhahnUnsubmitted

Not Done

I think from that formulation it is still not clear *what* kind of meaning is assigned to those special strings.

Unless I am missing something, it is still not clear *where* and *how* frontends should use/emit those special names. I think it would be good if the description would be clear on how new frontends should use the special names.

fhahn: I think from that formulation it is still not clear *what* kind of meaning is assigned to those…

Access tags are represented as ``MDNode`` s with either 3 or 4 operands. Access tags are represented as ``MDNode`` s with either 3 or 4 operands.

The first operand is an ``MDNode`` pointing to the node representing the The first operand is an ``MDNode`` pointing to the node representing the

base type. The second operand is an ``MDNode`` pointing to the node base type. The second operand is an ``MDNode`` pointing to the node

representing the access type. The third operand is a ``ConstantInt`` that representing the access type. The third operand is a ``ConstantInt`` that

states the offset of the access. If a fourth field is present, it must be states the offset of the access. If a fourth field is present, it must be

a ``ConstantInt`` valued at 0 or 1. If it is 1 then the access tag states a ``ConstantInt`` valued at 0 or 1. If it is 1 then the access tag states

that the location being accessed is "constant" (meaning that the location being accessed is "constant" (meaning

▲ Show 20 Lines • Show All 16,187 Lines • Show Last 20 Lines