This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
test/TableGen/
-
TableGen/
-
intrin-side-effects.td
-
utils/TableGen/
-
TableGen/
1
IntrinsicEmitter.cpp

Differential D76127

[TableGen] Do not set ReadOnly attribute on intrinsics with side effects
Needs RevisionPublic

Authored by TOCK on Mar 13 2020, 5:26 AM.

Download Raw Diff

Details

Reviewers

stoklund
jdoerfert
craig.topper
arsenm

Summary

If an intrinsic with side effects is called twice using the same set of
arguments, EarlyCSE would happily remove the second call because it has
ReadOnly attribute. Similar to 52c39396151978ca946e2a80d9118c8672bace14,
this patch makes TableGen not set Attribute::ReadOnly for intrinsics
which are declared with IntrHasSideEffects.

Diff Detail

Event Timeline

TOCK created this revision.Mar 13 2020, 5:26 AM

TOCK created this object with visibility "All Users".

Herald added a project: Restricted Project. · View Herald TranscriptMar 13 2020, 5:26 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B49126: Diff 250182.Mar 13 2020, 6:22 AM

For example, say we have these intrinsics:

def int_test_no_mem_v : Intrinsic<[], [llvm_i64_ty], [IntrNoMem, IntrHasSideEffects]>;
def int_test_read_mem_v : Intrinsic<[], [llvm_i64_ty], [IntrReadMem, IntrHasSideEffects]>;
def int_test_no_mem : Intrinsic<[llvm_i64_ty], [llvm_i64_ty], [IntrNoMem, IntrHasSideEffects]>;
def int_test_read_mem : Intrinsic<[llvm_i64_ty], [llvm_i64_ty], [IntrReadMem, IntrHasSideEffects]>;

Following code:

define i64 @foo(i64 %a, i64 %b) {
entry:
  call void @llvm.test.no.mem.v(i64 %a)
  call void @llvm.test.no.mem.v(i64 %a)
  call void @llvm.test.read.mem.v(i64 %b)
  call void @llvm.test.read.mem.v(i64 %b)
  %r1 = call i64 @llvm.test.no.mem(i64 %a)
  %r3 = call i64 @llvm.test.no.mem(i64 %a)
  %r2 = call i64 @llvm.test.read.mem(i64 %b)
  %r4 = call i64 @llvm.test.read.mem(i64 %b)
  %r12 = add nsw i64 %r1, %r2
  %r34 = add nsw i64 %r3, %r4
  %r14 = add nsw i64 %r12, %r34
  ret i64 %r14
}
declare void @llvm.test.no.mem.v(i64)
declare void @llvm.test.read.mem.v(i64)
declare i64 @llvm.test.no.mem(i64)
declare i64 @llvm.test.read.mem(i64)

would be optimized into:

define i64 @foo(i64 %a, i64 %b) local_unnamed_addr #0 {
entry:
  tail call void @llvm.test.no.mem.v(i64 %a)
  tail call void @llvm.test.no.mem.v(i64 %a)
  %r1 = tail call i64 @llvm.test.no.mem(i64 %a)
  %r3 = tail call i64 @llvm.test.no.mem(i64 %a)
  %r2 = tail call i64 @llvm.test.read.mem(i64 %b)
  %factor = shl i64 %r2, 1
  %r12 = add i64 %r3, %r1
  %r14 = add i64 %r12, %factor
  ret i64 %r14
}
...

Calls to @llvm.test.read.mem.v are removed completely (this is what D64414 tried to address), also one call to @llvm.test.read.mem is removed:

EarlyCSE DCE:   call void @llvm.test.read.mem.v(i64 %b)                                                                                           
EarlyCSE DCE:   call void @llvm.test.read.mem.v(i64 %b)                                                                                           
EarlyCSE CSE CALL:   %r4 = call i64 @llvm.test.read.mem(i64 %b)  to:   %r2 = call i64 @llvm.test.read.mem(i64 %b)                                 
        discovered a new reachable node %entry

This is due to the fact that TableGen doesn't respect IntrHasSideEffects, and make it read only:

...
    case 52: {
      const Attribute::AttrKind Atts[] = {Attribute::NoUnwind,Attribute::ReadOnly};
      AS[0] = AttributeList::get(C, AttributeList::FunctionIndex, Atts);
      NumAttrs = 1;
      break;
      }
...

TOCK added a reviewer: arsenm.Sep 28 2020, 3:38 AM

TOCK changed the visibility from "All Users" to "Public (No Login Required)".

Herald added a subscriber: wdng. · View Herald TranscriptSep 28 2020, 3:38 AM

The diffs lack context.

It looks like there is some inconsistency between .td attributes/properties and IR ones.
As far as I can tell there is no way in IR to mark something as "this doesn't touch mem but may have other side-effects" so it'd be safe to, for example, move some load past that instruction but not to DCE/CSE it.

Hi @danilaml,

The diffs lack context.

This patch is some kind of follow-up of D64414, which make TableGen stop emitting Attribute::ReadNone for IntrNoMem if such intrinsic is marked as IntrHasSideEffects. This patch does the same for other cases like IntrReadMem.

As far as I can tell there is no way in IR to mark something as "this doesn't touch mem but may have other side-effects" so it'd be safe to, for example, move some load past that instruction but not to DCE/CSE it.

As far I can tell most targets simply mark it with side-effects and rely on the backend to do optimization, since it might be able to model the side-effects and have a clearer view of what is really accessed.

@TOCK
I've meant the surrounding context for the diff (https://llvm.org/docs/Phabricator.html#phabricator-request-review-web),
i.e. git show HEAD -U999999

As far I can tell most targets simply mark it with side-effects and rely on the backend to do optimization, since it might be able to model the side-effects and have a clearer view of what is really accessed.

The problem is that is that it looks like the info is lost.
It is also unclear what is the effect of IntrNoMem/IntrReadMem, IntrHasSideEffects now. From documentation/patch it seems the original intent was to mark intrinsics that don't modify/use memory (so optimizations that rely on mayWriteMemory will work for them), but have some non memory related side-effects (so the wouldn't be CSE's/DCE'd and etc.).
But after this and D64414 patches such intrinsics would return mayWriteMemory, which feels counter-intuitive and against the initial intent. I don't know how to properly fix this (and at what level this problem lies).

danilaml added a reviewer: jdoerfert.Nov 27 2020, 3:14 AM

Which LLVM IR intrinsics are affected by this?

Include context as @danilaml suggested.

@danilaml
+1. It looks like a workaround at the moment.

@lebedev.ri
I'm not aware of any in-tree intrinsic affected. The affected one I found is generated by some testing tool.

In D76127#2421901, @TOCK wrote:

@danilaml
+1. It looks like a workaround at the moment.

@lebedev.ri
I'm not aware of any in-tree intrinsic affected. The affected one I found is generated by some testing tool.

Probably no in-tree intrinsic is affected because the current code causes SelectionDAGBuilder to generate broken code. I believe this is the FIXME on int_x86_sse_ldmxcsr in llvm/include/IR/IntrinsicsX86.td

Having just tripped over this bug again. I think we should fix this

LGTM

This revision is now accepted and ready to land.Jan 20 2021, 11:27 AM

I'd like for @jdoerfert to comment.
Can we simply disallow this somewhere and catch such a combination in a verifier or something?

TBH, I feel "X is readonly and has side effects" sends the wrong message to begin with. It is a contradiction (in the IR world) as basically shown by the need for this patch. Given that there are no examples in-tree I don't understand why one would mark a side-effect intrinsic as readonly (or similar). Long story short, I would argue this should be a loud error, not silently ignored.

In D76127#2510504, @jdoerfert wrote:

TBH, I feel "X is readonly and has side effects" sends the wrong message to begin with. It is a contradiction (in the IR world) as basically shown by the need for this patch. Given that there are no examples in-tree I don't understand why one would mark a side-effect intrinsic as readonly (or similar). Long story short, I would argue this should be a loud error, not silently ignored.

Why shouldn't the intrinsics file be able to express the semantics we want even if we can't represent it in IR today?

The annoying issue is that tablegen also uses the WriteMem to force the mayStore flag on any machine IR instruction that has a pattern that references the intrinsic. Machine IR has a separate side effects flag. So we want IntrReadMem+IntrHasSideEffects to only set mayLoad+hasSideEffects in machine IR.

And there are no examples in tree because it breaks today. See the FIXME on ldmxcsr. It causes SelectionDAGBuilder to fail to connect the output chain to the root node in the DAG which makes the intrinsic eligible for deletion when it shouldn't be.

craig.topper added inline comments.Jan 20 2021, 12:03 PM

llvm/utils/TableGen/IntrinsicEmitter.cpp
836	Thinking about this again, I think the entire switch should be skipped if hasSideEffects is set. None of the cases are valid. Which is implied by the if statement earlier checking (intrinsic.ModRef != CodeGenIntrinsic::ReadWriteMem && !intrinsic.hasSideEffects)

In D76127#2510562, @craig.topper wrote:

In D76127#2510504, @jdoerfert wrote:

TBH, I feel "X is readonly and has side effects" sends the wrong message to begin with. It is a contradiction (in the IR world) as basically shown by the need for this patch. Given that there are no examples in-tree I don't understand why one would mark a side-effect intrinsic as readonly (or similar). Long story short, I would argue this should be a loud error, not silently ignored.

Why shouldn't the intrinsics file be able to express the semantics we want even if we can't represent it in IR today?

My position hasn't changed much from before: https://reviews.llvm.org/D64414#1584449

I mean, the way we interpret the bits for the IR is contradictory, isn't it? In IR, you cannot have an intrinsic that is readonly and has side-effect. Why would we want to express that in the intrinsics file if it is impossible and probably a sign of a misconfiguration. Maybe I misunderstand what "side-effects" are supposed to be here but I find it confusing to say "readonly with side-effects". Interestingly, this "handling" of such a configuration disables middle-end optimizations to enable backend optimizations, which may or may not be worth it.

FWIW, I believe adding more categories other than "inaccessible memory" and "argument memory" is the right way to resolve this issue. Also, side-effect free instructions that are not willreturn should not be deleted in the future. This might not be interesting for the ldmxcsr case (which I don't know what the side-effect is), but for D64414 this might be the proper
way to model it.

Revoking my approval

This revision now requires changes to proceed.Jan 20 2021, 3:42 PM

I also don't think readonly makes sense with side effects

Herald added a project: Restricted Project. · View Herald TranscriptSep 28 2022, 2:02 PM

Herald added a subscriber: StephenFan. · View Herald Transcript

Revision Contents

Path

Size

llvm/

test/

TableGen/

intrin-side-effects.td

50 lines

utils/

TableGen/

IntrinsicEmitter.cpp

8 lines

Diff 308253

llvm/test/TableGen/intrin-side-effects.td

	// RUN: llvm-tblgen -gen-intrinsic-impl -I %p/../../include %s \| FileCheck %s			// RUN: llvm-tblgen -gen-intrinsic-impl -I %p/../../include %s \| FileCheck %s

	// Get the minimum blurb necessary to process ...			// Get the minimum blurb necessary to process ...
	include "llvm/CodeGen/ValueTypes.td"			include "llvm/CodeGen/ValueTypes.td"
	include "llvm/CodeGen/SDNodeProperties.td"			include "llvm/CodeGen/SDNodeProperties.td"

	class LLVMType<ValueType vt> {			class LLVMType<ValueType vt> {
	ValueType VT = vt;			ValueType VT = vt;
	int isAny = 0;			int isAny = 0;
	}			}

				class LLVMQualPointerType<LLVMType elty, int addrspace>
				: LLVMType<iPTR>{
				LLVMType ElTy = elty;
				int AddrSpace = addrspace;
				}

				class LLVMPointerType<LLVMType elty>
				: LLVMQualPointerType<elty, 0>;

				def llvm_i8_ty : LLVMType<i8>;
	def llvm_i32_ty : LLVMType<i32>;			def llvm_i32_ty : LLVMType<i32>;
				def llvm_ptr_ty : LLVMPointerType<llvm_i8_ty>;

	class IntrinsicProperty<bit is_default = 0> {			class IntrinsicProperty<bit is_default = 0> {
	bit IsDefault = is_default;			bit IsDefault = is_default;
	}			}

	def IntrNoMem : IntrinsicProperty;			def IntrNoMem : IntrinsicProperty;
	def IntrHasSideEffects : IntrinsicProperty;			def IntrHasSideEffects : IntrinsicProperty;
				def IntrReadMem : IntrinsicProperty;
				def IntrArgMemOnly : IntrinsicProperty;
				def IntrInaccessibleMemOnly : IntrinsicProperty;
				def IntrInaccessibleMemOrArgMemOnly : IntrinsicProperty;

	class Intrinsic<list<LLVMType> ret_types,			class Intrinsic<list<LLVMType> ret_types,
	list<LLVMType> param_types = [],			list<LLVMType> param_types = [],
	list<IntrinsicProperty> intr_properties = [],			list<IntrinsicProperty> intr_properties = [],
	string name = "",			string name = "",
	list<SDNodeProperty> sd_properties = [],			list<SDNodeProperty> sd_properties = [],
	bit disable_default_attributes = 0> : SDPatternOperator {			bit disable_default_attributes = 0> : SDPatternOperator {
	string LLVMName = name;			string LLVMName = name;
	string TargetPrefix = "";			string TargetPrefix = "";
	list<LLVMType> RetTypes = ret_types;			list<LLVMType> RetTypes = ret_types;
	list<LLVMType> ParamTypes = param_types;			list<LLVMType> ParamTypes = param_types;
	list<IntrinsicProperty> IntrProperties = intr_properties;			list<IntrinsicProperty> IntrProperties = intr_properties;
	let Properties = sd_properties;			let Properties = sd_properties;
	bit DisableDefaultAttributes = 1;			bit DisableDefaultAttributes = 1;


	bit isTarget = 0;			bit isTarget = 0;
	bit DisableDefaultAttributes = disable_default_attributes;			bit DisableDefaultAttributes = disable_default_attributes;
	}			}

	// ... this intrinsic.			// ... these intrinsic.
	def int_random_gen : Intrinsic<[llvm_i32_ty], [], [IntrNoMem, IntrHasSideEffects]>;
				// CHECK: 1, // llvm.random.gen.no.mem
				// CHECK: 2, // llvm.random.gen.read.arg.mem
				// CHECK: 3, // llvm.random.gen.read.inaccessible.mem
				// CHECK: 4, // llvm.random.gen.read.inaccessible.mem.or.arg.mem
				// CHECK: 5, // llvm.random.gen.read.mem

				// CHECK: case 5:
				// CHECK-NEXT: Atts[] = {Attribute::NoUnwind}
				def int_random_gen_read_mem
				: Intrinsic<[llvm_i32_ty], [], [IntrReadMem, IntrHasSideEffects]>;

				// CHECK: case 4:
				// CHECK-NEXT: Atts[] = {Attribute::NoUnwind}
				def int_random_gen_read_inaccessible_mem_or_arg_mem
				: Intrinsic<[llvm_i32_ty], [], [IntrReadMem,IntrInaccessibleMemOrArgMemOnly,
				IntrHasSideEffects]>;

				// CHECK: case 2:
				// CHECK-NEXT: Atts[] = {Attribute::NoUnwind}
				def int_random_gen_read_arg_mem
				: Intrinsic<[llvm_i32_ty], [llvm_ptr_ty], [IntrReadMem, IntrArgMemOnly,
				IntrHasSideEffects]>;

				// CHECK: case 3:
				// CHECK-NEXT: Atts[] = {Attribute::NoUnwind}
				def int_random_gen_read_inaccessible_mem
				: Intrinsic<[llvm_i32_ty], [], [IntrReadMem, IntrInaccessibleMemOnly,
				IntrHasSideEffects]>;

	// CHECK: 1, // llvm.random.gen
	// CHECK: case 1:			// CHECK: case 1:
	// CHECK-NEXT: Atts[] = {Attribute::NoUnwind}			// CHECK-NEXT: Atts[] = {Attribute::NoUnwind}
				def int_random_gen_no_mem
				: Intrinsic<[llvm_i32_ty], [], [IntrNoMem, IntrHasSideEffects]>;

llvm/utils/TableGen/IntrinsicEmitter.cpp

Show First 20 Lines • Show All 827 Lines • ▼ Show 20 Lines	if (!intrinsic.canThrow \|\|
case CodeGenIntrinsic::NoMem:		case CodeGenIntrinsic::NoMem:
if (intrinsic.hasSideEffects)		if (intrinsic.hasSideEffects)
break;		break;
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::ReadNone";		OS << "Attribute::ReadNone";
break;		break;
case CodeGenIntrinsic::ReadArgMem:		case CodeGenIntrinsic::ReadArgMem:
		if (intrinsic.hasSideEffects)
		craig.topperUnsubmitted Not Done Reply Inline Actions Thinking about this again, I think the entire switch should be skipped if hasSideEffects is set. None of the cases are valid. Which is implied by the if statement earlier checking (intrinsic.ModRef != CodeGenIntrinsic::ReadWriteMem && !intrinsic.hasSideEffects) craig.topper: Thinking about this again, I think the entire switch should be skipped if hasSideEffects is set.
		break;
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::ReadOnly,";		OS << "Attribute::ReadOnly,";
OS << "Attribute::ArgMemOnly";		OS << "Attribute::ArgMemOnly";
break;		break;
case CodeGenIntrinsic::ReadMem:		case CodeGenIntrinsic::ReadMem:
		if (intrinsic.hasSideEffects)
		break;
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::ReadOnly";		OS << "Attribute::ReadOnly";
break;		break;
case CodeGenIntrinsic::ReadInaccessibleMem:		case CodeGenIntrinsic::ReadInaccessibleMem:
		if (intrinsic.hasSideEffects)
		break;
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::ReadOnly,";		OS << "Attribute::ReadOnly,";
OS << "Attribute::InaccessibleMemOnly";		OS << "Attribute::InaccessibleMemOnly";
break;		break;
case CodeGenIntrinsic::ReadInaccessibleMemOrArgMem:		case CodeGenIntrinsic::ReadInaccessibleMemOrArgMem:
		if (intrinsic.hasSideEffects)
		break;
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::ReadOnly,";		OS << "Attribute::ReadOnly,";
OS << "Attribute::InaccessibleMemOrArgMemOnly";		OS << "Attribute::InaccessibleMemOrArgMemOnly";
break;		break;
case CodeGenIntrinsic::WriteArgMem:		case CodeGenIntrinsic::WriteArgMem:
if (addComma)		if (addComma)
OS << ",";		OS << ",";
▲ Show 20 Lines • Show All 153 Lines • Show Last 20 Lines