This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/IR/
-
llvm/
-
IR/
-
Attributes.h
-
lib/
-
IR/
-
Attributes.cpp
-
Transforms/InstCombine/
-
InstCombine/
-
InstCombineCalls.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
1/2
call-cast-attrs.ll

Differential D119967

[InstCombineCalls] Optimize call of bitcast even w/ parameter attributes
ClosedPublic

Authored by jdoerfert on Feb 16 2022, 12:38 PM.

Download Raw Diff

Details

Reviewers

aeubanks
lebedev.ri
nikic
reames
spatel
efriedma

Commits

rGbb0b23174e4a: [InstCombineCalls] Optimize call of bitcast even w/ parameter attributes

Summary

Before we gave up if a call through bitcast had parameter attributes.
Interestingly, we allowed attributes for the return value already. We
now handle both the same way, namely, we drop the ones that are
incompatible with the new type and keep the rest. This cannot cause
"more UB" than initially present.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jdoerfert created this revision.Feb 16 2022, 12:38 PM

Herald added subscribers: bollu, hiraditya. · View Herald TranscriptFeb 16 2022, 12:39 PM

jdoerfert requested review of this revision.Feb 16 2022, 12:39 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 16 2022, 12:39 PM

I'm not convinced this is right when there are ABI affecting attributes involved. Consider the first example, where zeroext is dropped. Let's hypothetically assume that this indicates a zeroext up to 64-bit, but we have 32-bit pointers. Doing the transform and dropping the zeroext means that the top bits will be left uninitialized now.

For non-ABI attributes, I agree that just dropping them is fine.

In D119967#3327278, @nikic wrote:

I'm not convinced this is right when there are ABI affecting attributes involved. Consider the first example, where zeroext is dropped. Let's hypothetically assume that this indicates a zeroext up to 64-bit, but we have 32-bit pointers. Doing the transform and dropping the zeroext means that the top bits will be left uninitialized now.

I don't think zeroext is a problem. The 64 bit case shows why, isBitOrNoopPointerCastable is false and we won't transform it.

For non-ABI attributes, I agree that just dropping them is fine.

Harbormaster completed remote builds in B150048: Diff 409371.Feb 16 2022, 1:40 PM

aeubanks added inline comments.Feb 21 2022, 4:40 PM

llvm/test/Transforms/InstCombine/call-cast-attrs.ll
36	is losing the nocapture worth changing this to a direct call?

jdoerfert added inline comments.Feb 21 2022, 7:41 PM

llvm/test/Transforms/InstCombine/call-cast-attrs.ll
36	Yes, because we cannot analyze or inline indirect calls. The nocapture here would not be derived by any pass.

In D119967#3327289, @jdoerfert wrote:

In D119967#3327278, @nikic wrote:

I'm not convinced this is right when there are ABI affecting attributes involved. Consider the first example, where zeroext is dropped. Let's hypothetically assume that this indicates a zeroext up to 64-bit, but we have 32-bit pointers. Doing the transform and dropping the zeroext means that the top bits will be left uninitialized now.

I don't think zeroext is a problem. The 64 bit case shows why, isBitOrNoopPointerCastable is false and we won't transform it.

I'm referring to the case where the argument is pointer-sized (so it is castable), but the (target-specific) semantics of zeroext require an extension beyond pointer size.

In D119967#3336985, @nikic wrote:

In D119967#3327289, @jdoerfert wrote:

In D119967#3327278, @nikic wrote:

I'm not convinced this is right when there are ABI affecting attributes involved. Consider the first example, where zeroext is dropped. Let's hypothetically assume that this indicates a zeroext up to 64-bit, but we have 32-bit pointers. Doing the transform and dropping the zeroext means that the top bits will be left uninitialized now.

I don't think zeroext is a problem. The 64 bit case shows why, isBitOrNoopPointerCastable is false and we won't transform it.

I'm referring to the case where the argument is pointer-sized (so it is castable), but the (target-specific) semantics of zeroext require an extension beyond pointer size.

So you are saying it is legal and expected that we extend the argument even though the callee expects a pointer (of smaller size)? (How is that not UB?)

friendly ping :)

Herald added a project: Restricted Project. · View Herald TranscriptMar 18 2022, 4:15 PM

In general, the reason we don't just treat calling a bitcast as instant UB is that we want to avoid breaking cases where different signatures represent equivalent calls, but the IR types aren't compatible. We don't have any standard rule for lowering C calls to IR, so different language frontends might do slightly different things.

Along those lines, some attributes hide significant differences in how the call is actually lowered; for example, byval means that a "pointer" isn't really a pointer at all. Some attributes are obviously irrelevant, like noundef.

Going through the LangRef list, zeroext/signext/inreg/sret/nest/swiftself/swiftasync/swifterror are the parameter attributes which this code currently doesn't explicitly check for, but are ABI significant. But... given the motivation, they're unlikely to be relevant, I guess? If there's a mismatch, and the call otherwise passes the checks, the untransformed call is probably UB anyway.

That said, I don't really want to try to write up a coherent justification for each of those, or worry about attributes that will be implemented in the future. An explicit list of attributes which are obviously safe to drop would be much simpler to reason about.

Introduce the concept of safe and unsafe attributes wrt. dropping them

Herald added a subscriber: dexonsmith. · View Herald TranscriptMar 27 2022, 10:03 PM

In D119967#3410527, @jdoerfert wrote:

Introduce the concept of safe and unsafe attributes wrt. dropping them

Nice!

LGTM

The classification into safe/unsafe makes sense to me. There's probably some further cleanup we could do here, but this seems like a reasonable intermediate step.

This revision is now accepted and ready to land.Mar 28 2022, 4:08 PM

Harbormaster completed remote builds in B156487: Diff 418491.Mar 28 2022, 5:46 PM

Closed by commit rGbb0b23174e4a: [InstCombineCalls] Optimize call of bitcast even w/ parameter attributes (authored by jdoerfert). · Explain WhyMar 28 2022, 6:58 PM

This revision was automatically updated to reflect the committed changes.

jdoerfert added a commit: rGbb0b23174e4a: [InstCombineCalls] Optimize call of bitcast even w/ parameter attributes.

Revision Contents

Path

Size

llvm/

include/

llvm/

IR/

Attributes.h

13 lines

lib/

IR/

Attributes.cpp

62 lines

Transforms/

InstCombine/

InstCombineCalls.cpp

25 lines

test/

Transforms/

InstCombine/

call-cast-attrs.ll

17 lines

Diff 418755

llvm/include/llvm/IR/Attributes.h

Show First 20 Lines • Show All 1,205 Lines • ▼ Show 20 Lines	public:
ArrayRef<Attribute> attrs() const { return Attrs; }		ArrayRef<Attribute> attrs() const { return Attrs; }

bool operator==(const AttrBuilder &B) const;		bool operator==(const AttrBuilder &B) const;
bool operator!=(const AttrBuilder &B) const { return !(*this == B); }		bool operator!=(const AttrBuilder &B) const { return !(*this == B); }
};		};

namespace AttributeFuncs {		namespace AttributeFuncs {

/// Which attributes cannot be applied to a type.		enum AttributeSafetyKind : uint8_t {
AttributeMask typeIncompatible(Type *Ty);		ASK_SAFE_TO_DROP = 1,
		ASK_UNSAFE_TO_DROP = 2,
		ASK_ALL = ASK_SAFE_TO_DROP \| ASK_UNSAFE_TO_DROP,
		};

		/// Which attributes cannot be applied to a type. The argument \p ASK indicates,
		/// if only attributes that are known to be safely droppable are contained in
		/// the mask; only attributes that might be unsafe to drop (e.g., ABI-related
		/// attributes) are in the mask; or both.
		AttributeMask typeIncompatible(Type *Ty, AttributeSafetyKind ASK = ASK_ALL);

/// Get param/return attributes which imply immediate undefined behavior if an		/// Get param/return attributes which imply immediate undefined behavior if an
/// invalid value is passed. For example, this includes noundef (where undef		/// invalid value is passed. For example, this includes noundef (where undef
/// implies UB), but not nonnull (where null implies poison). It also does not		/// implies UB), but not nonnull (where null implies poison). It also does not
/// include attributes like nocapture, which constrain the function		/// include attributes like nocapture, which constrain the function
/// implementation rather than the passed value.		/// implementation rather than the passed value.
AttributeMask getUBImplyingAttributes();		AttributeMask getUBImplyingAttributes();

Show All 27 Lines

llvm/lib/IR/Attributes.cpp

Show First 20 Lines • Show All 1,767 Lines • ▼ Show 20 Lines	bool AttrBuilder::operator==(const AttrBuilder &B) const {
return Attrs == B.Attrs;		return Attrs == B.Attrs;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AttributeFuncs Function Defintions		// AttributeFuncs Function Defintions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// Which attributes cannot be applied to a type.		/// Which attributes cannot be applied to a type.
AttributeMask AttributeFuncs::typeIncompatible(Type *Ty) {		AttributeMask AttributeFuncs::typeIncompatible(Type *Ty,
		AttributeSafetyKind ASK) {
AttributeMask Incompatible;		AttributeMask Incompatible;

if (!Ty->isIntegerTy())		if (!Ty->isIntegerTy()) {
// Attributes that only apply to integers.		// Attributes that only apply to integers.
Incompatible.addAttribute(Attribute::SExt)		if (ASK & ASK_SAFE_TO_DROP)
.addAttribute(Attribute::ZExt)		Incompatible.addAttribute(Attribute::AllocAlign);
.addAttribute(Attribute::AllocAlign);		if (ASK & ASK_UNSAFE_TO_DROP)
		Incompatible.addAttribute(Attribute::SExt).addAttribute(Attribute::ZExt);
		}

if (!Ty->isPointerTy())		if (!Ty->isPointerTy()) {
// Attributes that only apply to pointers.		// Attributes that only apply to pointers.
Incompatible.addAttribute(Attribute::Nest)		if (ASK & ASK_SAFE_TO_DROP)
.addAttribute(Attribute::NoAlias)		Incompatible.addAttribute(Attribute::NoAlias)
.addAttribute(Attribute::NoCapture)		.addAttribute(Attribute::NoCapture)
.addAttribute(Attribute::NonNull)		.addAttribute(Attribute::NonNull)
.addAttribute(Attribute::ReadNone)		.addAttribute(Attribute::ReadNone)
.addAttribute(Attribute::ReadOnly)		.addAttribute(Attribute::ReadOnly)
.addAttribute(Attribute::SwiftError)
.addAttribute(Attribute::Dereferenceable)		.addAttribute(Attribute::Dereferenceable)
.addAttribute(Attribute::DereferenceableOrNull)		.addAttribute(Attribute::DereferenceableOrNull);
		if (ASK & ASK_UNSAFE_TO_DROP)
		Incompatible.addAttribute(Attribute::Nest)
		.addAttribute(Attribute::SwiftError)
.addAttribute(Attribute::Preallocated)		.addAttribute(Attribute::Preallocated)
.addAttribute(Attribute::InAlloca)		.addAttribute(Attribute::InAlloca)
.addAttribute(Attribute::ByVal)		.addAttribute(Attribute::ByVal)
.addAttribute(Attribute::StructRet)		.addAttribute(Attribute::StructRet)
.addAttribute(Attribute::ByRef)		.addAttribute(Attribute::ByRef)
.addAttribute(Attribute::ElementType);		.addAttribute(Attribute::ElementType);
		}

if (!Ty->isPtrOrPtrVectorTy())
// Attributes that only apply to pointers or vectors of pointers.		// Attributes that only apply to pointers or vectors of pointers.
		if (!Ty->isPtrOrPtrVectorTy()) {
		if (ASK & ASK_SAFE_TO_DROP)
Incompatible.addAttribute(Attribute::Alignment);		Incompatible.addAttribute(Attribute::Alignment);
		}

// Some attributes can apply to all "values" but there are no `void` values.		// Some attributes can apply to all "values" but there are no `void` values.
if (Ty->isVoidTy())		if (Ty->isVoidTy()) {
		if (ASK & ASK_SAFE_TO_DROP)
Incompatible.addAttribute(Attribute::NoUndef);		Incompatible.addAttribute(Attribute::NoUndef);
		}

return Incompatible;		return Incompatible;
}		}

AttributeMask AttributeFuncs::getUBImplyingAttributes() {		AttributeMask AttributeFuncs::getUBImplyingAttributes() {
AttributeMask AM;		AttributeMask AM;
AM.addAttribute(Attribute::NoUndef);		AM.addAttribute(Attribute::NoUndef);
AM.addAttribute(Attribute::Dereferenceable);		AM.addAttribute(Attribute::Dereferenceable);
▲ Show 20 Lines • Show All 203 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp

Show First 20 Lines • Show All 3,192 Lines • ▼ Show 20 Lines	bool InstCombinerImpl::transformConstExprCastCall(CallBase &Call) {
auto AI = Call.arg_begin();		auto AI = Call.arg_begin();
for (unsigned i = 0, e = NumCommonArgs; i != e; ++i, ++AI) {		for (unsigned i = 0, e = NumCommonArgs; i != e; ++i, ++AI) {
Type *ParamTy = FT->getParamType(i);		Type *ParamTy = FT->getParamType(i);
Type ActTy = (AI)->getType();		Type ActTy = (AI)->getType();

if (!CastInst::isBitOrNoopPointerCastable(ActTy, ParamTy, DL))		if (!CastInst::isBitOrNoopPointerCastable(ActTy, ParamTy, DL))
return false; // Cannot transform this parameter value.		return false; // Cannot transform this parameter value.

		// Check if there are any incompatible attributes we cannot drop safely.
if (AttrBuilder(FT->getContext(), CallerPAL.getParamAttrs(i))		if (AttrBuilder(FT->getContext(), CallerPAL.getParamAttrs(i))
.overlaps(AttributeFuncs::typeIncompatible(ParamTy)))		.overlaps(AttributeFuncs::typeIncompatible(
		ParamTy, AttributeFuncs::ASK_UNSAFE_TO_DROP)))
return false; // Attribute not compatible with transformed value.		return false; // Attribute not compatible with transformed value.

if (Call.isInAllocaArgument(i))		if (Call.isInAllocaArgument(i) \|\|
return false; // Cannot transform to and from inalloca.		CallerPAL.hasParamAttr(i, Attribute::Preallocated))
		return false; // Cannot transform to and from inalloca/preallocated.

if (CallerPAL.hasParamAttr(i, Attribute::SwiftError))		if (CallerPAL.hasParamAttr(i, Attribute::SwiftError))
return false;		return false;

// If the parameter is passed as a byval argument, then we have to have a		// If the parameter is passed as a byval argument, then we have to have a
// sized type and the sized type has to have the same size as the old type.		// sized type and the sized type has to have the same size as the old type.
if (ParamTy != ActTy && CallerPAL.hasParamAttr(i, Attribute::ByVal)) {		if (ParamTy != ActTy && CallerPAL.hasParamAttr(i, Attribute::ByVal)) {
PointerType *ParamPTy = dyn_cast<PointerType>(ParamTy);		PointerType *ParamPTy = dyn_cast<PointerType>(ParamTy);
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	bool InstCombinerImpl::transformConstExprCastCall(CallBase &Call) {
for (unsigned i = 0; i != NumCommonArgs; ++i, ++AI) {		for (unsigned i = 0; i != NumCommonArgs; ++i, ++AI) {
Type *ParamTy = FT->getParamType(i);		Type *ParamTy = FT->getParamType(i);

Value NewArg = AI;		Value NewArg = AI;
if ((*AI)->getType() != ParamTy)		if ((*AI)->getType() != ParamTy)
NewArg = Builder.CreateBitOrPointerCast(*AI, ParamTy);		NewArg = Builder.CreateBitOrPointerCast(*AI, ParamTy);
Args.push_back(NewArg);		Args.push_back(NewArg);

// Add any parameter attributes.		// Add any parameter attributes except the ones incompatible with the new
		// type. Note that we made sure all incompatible ones are safe to drop.
		AttributeMask IncompatibleAttrs = AttributeFuncs::typeIncompatible(
		ParamTy, AttributeFuncs::ASK_SAFE_TO_DROP);
if (CallerPAL.hasParamAttr(i, Attribute::ByVal) &&		if (CallerPAL.hasParamAttr(i, Attribute::ByVal) &&
!ParamTy->isOpaquePointerTy()) {		!ParamTy->isOpaquePointerTy()) {
AttrBuilder AB(FT->getContext(), CallerPAL.getParamAttrs(i));		AttrBuilder AB(Ctx, CallerPAL.getParamAttrs(i).removeAttributes(
AB.addByValAttr(ParamTy->getNonOpaquePointerElementType());		Ctx, IncompatibleAttrs));
		AB.addByValAttr(NewArg->getType()->getPointerElementType());
ArgAttrs.push_back(AttributeSet::get(Ctx, AB));		ArgAttrs.push_back(AttributeSet::get(Ctx, AB));
} else		} else {
ArgAttrs.push_back(CallerPAL.getParamAttrs(i));		ArgAttrs.push_back(
		CallerPAL.getParamAttrs(i).removeAttributes(Ctx, IncompatibleAttrs));
		}
}		}

// If the function takes more arguments than the call was taking, add them		// If the function takes more arguments than the call was taking, add them
// now.		// now.
for (unsigned i = NumCommonArgs; i != FT->getNumParams(); ++i) {		for (unsigned i = NumCommonArgs; i != FT->getNumParams(); ++i) {
Args.push_back(Constant::getNullValue(FT->getParamType(i)));		Args.push_back(Constant::getNullValue(FT->getParamType(i)));
ArgAttrs.push_back(AttributeSet());		ArgAttrs.push_back(AttributeSet());
}		}
▲ Show 20 Lines • Show All 246 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/call-cast-attrs.ll

	; RUN: opt < %s -passes=instcombine -S \| FileCheck %s			; RUN: opt < %s -passes=instcombine -data-layout="p:32:32" -S \| FileCheck %s --check-prefixes=CHECK,CHECK32
				; RUN: opt < %s -passes=instcombine -data-layout="p:64:64" -S \| FileCheck %s --check-prefixes=CHECK,CHECK64

	define signext i32 @b(i32* inreg %x) {			define signext i32 @b(i32* inreg %x) {
	ret i32 0			ret i32 0
	}			}

	define void @c(...) {			define void @c(...) {
	ret void			ret void
	}			}

	declare void @useit(i32)			declare void @useit(i32)

	define void @d(i32 %x, ...) {			define void @d(i32 %x, ...) {
	call void @useit(i32 %x)			call void @useit(i32 %x)
	ret void			ret void
	}			}

	define void @g(i32* %y) {			define void @g(i32* %y) {
	call i32 bitcast (i32 (i32) @b to i32 (i32)*)(i32 zeroext 0)			call i32 bitcast (i32 (i32) @b to i32 (i32)*)(i32 zeroext 0)
	call void bitcast (void (...)* @c to void (i32))(i32* %y)			call void bitcast (void (...)* @c to void (i32))(i32* %y)
	call void bitcast (void (...)* @c to void (i32))(i32* sret(i32) %y)			call void bitcast (void (...)* @c to void (i32))(i32* sret(i32) %y)
	call void bitcast (void (i32, ...)* @d to void (i32, i32))(i32 0, i32* sret(i32) %y)			call void bitcast (void (i32, ...)* @d to void (i32, i32))(i32 0, i32* sret(i32) %y)
				call void bitcast (void (i32, ...)* @d to void (i32, i32))(i32 0, i32* nocapture %y)
				call void bitcast (void (i32, ...)* @d to void (i32))(i32* nocapture noundef %y)
	ret void			ret void
	}			}
	; CHECK-LABEL: define void @g(i32* %y)			; CHECK-LABEL: define void @g(i32* %y)
	; CHECK: call i32 bitcast (i32 (i32) @b to i32 (i32)*)(i32 zeroext 0)			; CHECK: call i32 bitcast (i32 (i32) @b to i32 (i32)*)(i32 zeroext 0)
	; CHECK: call void (...) @c(i32* %y)			; CHECK: call void (...) @c(i32* %y)
	; CHECK: call void bitcast (void (...)* @c to void (i32))(i32* sret(i32) %y)			; CHECK: call void bitcast (void (...)* @c to void (i32))(i32* sret(i32) %y)
	; CHECK: call void bitcast (void (i32, ...)* @d to void (i32, i32))(i32 0, i32* sret(i32) %y)			; CHECK: call void bitcast (void (i32, ...)* @d to void (i32, i32))(i32 0, i32* sret(i32) %y)
				; CHECK: call void (i32, ...) @d(i32 0, i32* nocapture %y)
				; CHECK32: %2 = ptrtoint i32* %y to i32
				; CHECK32: call void (i32, ...) @d(i32 noundef %2)
				; CHECK64: call void bitcast (void (i32, ...)* @d to void (i32))(i32* nocapture noundef %y)
				aeubanksUnsubmitted Not Done Reply Inline Actions is losing the nocapture worth changing this to a direct call? aeubanks: is losing the nocapture worth changing this to a direct call?
				jdoerfertAuthorUnsubmitted Done Reply Inline Actions Yes, because we cannot analyze or inline indirect calls. The nocapture here would not be derived by any pass. jdoerfert: Yes, because we cannot analyze or inline indirect calls. The nocapture here would not be…