Diff 411458

llvm/docs/AMDGPUUsage.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 915 Lines • ▼ Show 20 Lines	.. table:: AMDGPU LLVM IR Attributes
"amdgpu-no-queue-ptr" Similar to amdgpu-no-workitem-id-x, except for the		"amdgpu-no-queue-ptr" Similar to amdgpu-no-workitem-id-x, except for the
llvm.amdgcn.queue.ptr intrinsic. Note that unlike the other ABI hint		llvm.amdgcn.queue.ptr intrinsic. Note that unlike the other ABI hint
attributes, the queue pointer may be required in situations where the		attributes, the queue pointer may be required in situations where the
intrinsic call does not directly appear in the program. Some subtargets		intrinsic call does not directly appear in the program. Some subtargets
require the queue pointer for to handle some addrspacecasts, as well		require the queue pointer for to handle some addrspacecasts, as well
as the llvm.amdgcn.is.shared, llvm.amdgcn.is.private, llvm.trap, and		as the llvm.amdgcn.is.shared, llvm.amdgcn.is.private, llvm.trap, and
llvm.debug intrinsics.		llvm.debug intrinsics.

		"amdgpu-no-hostcall-ptr" Similar to amdgpu-no-implicitarg-ptr, except specific to the implicit
		kernel argument that holds the pointer to the hostcall buffer. If this
		attribute is absent, then the amdgpu-no-implicitarg-ptr is also removed.

		"amdgpu-no-heap-ptr" Similar to amdgpu-no-implicitarg-ptr, except specific to the implicit
		kernel argument that holds the pointer to an initialized memory buffer
		that conforms to the requirements of the malloc/free device library V1
		version implementation. If this attribute is absent, then the
		amdgpu-no-implicitarg-ptr is also removed.

======================================= ==========================================================		======================================= ==========================================================

.. _amdgpu-elf-code-object:		.. _amdgpu-elf-code-object:

ELF Code Object		ELF Code Object
===============		===============

The AMDGPU backend generates a standard ELF [ELF]_ relocatable code object that		The AMDGPU backend generates a standard ELF [ELF]_ relocatable code object that
▲ Show 20 Lines • Show All 2,638 Lines • ▼ Show 20 Lines	".value_kind" string Required Kernel argument kind that
of the Z dimension, if it exists. Must be zero if a partial		of the Z dimension, if it exists. Must be zero if a partial
work group does not exist in the Z dimension.		work group does not exist in the Z dimension.

"hidden_grid_dims"		"hidden_grid_dims"
The grid dispatch dimensionality. This is the same value		The grid dispatch dimensionality. This is the same value
as the AQL dispatch packet dimensionality. Must be a value		as the AQL dispatch packet dimensionality. Must be a value
between 1 and 3.		between 1 and 3.

		"hidden_heap_v1"
		A global address space pointer to an initialized memory
		buffer that conforms to the requirements of the malloc/free
		device library V1 version implementation.

"hidden_private_base"		"hidden_private_base"
The high 32 bits of the flat addressing private aperture base.		The high 32 bits of the flat addressing private aperture base.
Only used by GFX8 to allow conversion between private segment		Only used by GFX8 to allow conversion between private segment
and flat addresses. See :ref:`amdgpu-amdhsa-kernel-prolog-flat-scratch`.		and flat addresses. See :ref:`amdgpu-amdhsa-kernel-prolog-flat-scratch`.

"hidden_shared_base"		"hidden_shared_base"
The high 32 bits of the flat addressing shared aperture base.		The high 32 bits of the flat addressing shared aperture base.
Only used by GFX8 to allow conversion between shared segment		Only used by GFX8 to allow conversion between shared segment
▲ Show 20 Lines • Show All 8,955 Lines • Show Last 20 Lines

llvm/lib/BinaryFormat/AMDGPUMetadataVerifier.cpp

Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	if (!verifyScalarEntry(ArgsMap, ".name", false,
return false;		return false;
if (!verifyScalarEntry(ArgsMap, ".type_name", false,		if (!verifyScalarEntry(ArgsMap, ".type_name", false,
msgpack::Type::String))		msgpack::Type::String))
return false;		return false;
if (!verifyIntegerEntry(ArgsMap, ".size", true))		if (!verifyIntegerEntry(ArgsMap, ".size", true))
return false;		return false;
if (!verifyIntegerEntry(ArgsMap, ".offset", true))		if (!verifyIntegerEntry(ArgsMap, ".offset", true))
return false;		return false;
if (!verifyScalarEntry(ArgsMap, ".value_kind", true,		if (!verifyScalarEntry(ArgsMap, ".value_kind", true, msgpack::Type::String,
		sameerdsUnsubmitted Not Done Reply Inline Actions Seems to be a whitespace change introduced by clang-format? sameerds: Seems to be a whitespace change introduced by clang-format?
		cfangAuthorUnsubmitted Done Reply Inline Actions Right, a clang-format change. cfang: Right, a clang-format change.
msgpack::Type::String,
[](msgpack::DocNode &SNode) {		[](msgpack::DocNode &SNode) {
return StringSwitch<bool>(SNode.getString())		return StringSwitch<bool>(SNode.getString())
.Case("by_value", true)		.Case("by_value", true)
.Case("global_buffer", true)		.Case("global_buffer", true)
.Case("dynamic_shared_pointer", true)		.Case("dynamic_shared_pointer", true)
.Case("sampler", true)		.Case("sampler", true)
.Case("image", true)		.Case("image", true)
.Case("pipe", true)		.Case("pipe", true)
Show All 9 Lines	if (!verifyScalarEntry(ArgsMap, ".value_kind", true, msgpack::Type::String,
.Case("hidden_remainder_z", true)		.Case("hidden_remainder_z", true)
.Case("hidden_global_offset_x", true)		.Case("hidden_global_offset_x", true)
.Case("hidden_global_offset_y", true)		.Case("hidden_global_offset_y", true)
.Case("hidden_global_offset_z", true)		.Case("hidden_global_offset_z", true)
.Case("hidden_grid_dims", true)		.Case("hidden_grid_dims", true)
.Case("hidden_none", true)		.Case("hidden_none", true)
.Case("hidden_printf_buffer", true)		.Case("hidden_printf_buffer", true)
.Case("hidden_hostcall_buffer", true)		.Case("hidden_hostcall_buffer", true)
		.Case("hidden_heap_v1", true)
.Case("hidden_default_queue", true)		.Case("hidden_default_queue", true)
.Case("hidden_completion_action", true)		.Case("hidden_completion_action", true)
.Case("hidden_multigrid_sync_arg", true)		.Case("hidden_multigrid_sync_arg", true)
.Case("hidden_private_base", true)		.Case("hidden_private_base", true)
.Case("hidden_shared_base", true)		.Case("hidden_shared_base", true)
.Case("hidden_queue_ptr", true)		.Case("hidden_queue_ptr", true)
.Default(false);		.Default(false);
}))		}))
▲ Show 20 Lines • Show All 171 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AMDGPUAttributes.def

	Show All 13 Lines

	// NOTE: NO INCLUDE GUARD DESIRED!			// NOTE: NO INCLUDE GUARD DESIRED!

	AMDGPU_ATTRIBUTE(DISPATCH_PTR, "amdgpu-no-dispatch-ptr")			AMDGPU_ATTRIBUTE(DISPATCH_PTR, "amdgpu-no-dispatch-ptr")
	AMDGPU_ATTRIBUTE(QUEUE_PTR, "amdgpu-no-queue-ptr")			AMDGPU_ATTRIBUTE(QUEUE_PTR, "amdgpu-no-queue-ptr")
	AMDGPU_ATTRIBUTE(DISPATCH_ID, "amdgpu-no-dispatch-id")			AMDGPU_ATTRIBUTE(DISPATCH_ID, "amdgpu-no-dispatch-id")
	AMDGPU_ATTRIBUTE(IMPLICIT_ARG_PTR, "amdgpu-no-implicitarg-ptr")			AMDGPU_ATTRIBUTE(IMPLICIT_ARG_PTR, "amdgpu-no-implicitarg-ptr")
	AMDGPU_ATTRIBUTE(HOSTCALL_PTR, "amdgpu-no-hostcall-ptr")			AMDGPU_ATTRIBUTE(HOSTCALL_PTR, "amdgpu-no-hostcall-ptr")
				AMDGPU_ATTRIBUTE(HEAP_PTR, "amdgpu-no-heap-ptr")
				sameerdsUnsubmitted Done Reply Inline Actions It seems we need to document these attributes in AMDGPUUsage, which I missed for no-hostcall-ptr. https://llvm.org/docs/AMDGPUUsage.html#llvm-ir-attributes sameerds: It seems we need to document these attributes in AMDGPUUsage, which I missed for no-hostcall…
				cfangAuthorUnsubmitted Done Reply Inline Actions Can you suggest the description for both no-hostcall and no-heap-ptr? Thanks. cfang: Can you suggest the description for both no-hostcall and no-heap-ptr? Thanks.
				sameerdsUnsubmitted Done Reply Inline Actions How about this: "Similar to amdgpu-no-implicitarg-ptr, except specific to the implicit kernel argument that holds the pointer to the hostcall buffer. If this attribute is absent, then the amdgpu-no-implicitarg-ptr is also removed." I do believe the removal already happens, but it will be good to convince ourselves about that. sameerds: How about this: "Similar to amdgpu-no-implicitarg-ptr, except specific to the implicit kernel…
				cfangAuthorUnsubmitted Done Reply Inline Actions Agreed! Thanks for the suggestions. cfang: Agreed! Thanks for the suggestions.
	AMDGPU_ATTRIBUTE(WORKGROUP_ID_X, "amdgpu-no-workgroup-id-x")			AMDGPU_ATTRIBUTE(WORKGROUP_ID_X, "amdgpu-no-workgroup-id-x")
	AMDGPU_ATTRIBUTE(WORKGROUP_ID_Y, "amdgpu-no-workgroup-id-y")			AMDGPU_ATTRIBUTE(WORKGROUP_ID_Y, "amdgpu-no-workgroup-id-y")
	AMDGPU_ATTRIBUTE(WORKGROUP_ID_Z, "amdgpu-no-workgroup-id-z")			AMDGPU_ATTRIBUTE(WORKGROUP_ID_Z, "amdgpu-no-workgroup-id-z")
	AMDGPU_ATTRIBUTE(WORKITEM_ID_X, "amdgpu-no-workitem-id-x")			AMDGPU_ATTRIBUTE(WORKITEM_ID_X, "amdgpu-no-workitem-id-x")
	AMDGPU_ATTRIBUTE(WORKITEM_ID_Y, "amdgpu-no-workitem-id-y")			AMDGPU_ATTRIBUTE(WORKITEM_ID_Y, "amdgpu-no-workitem-id-y")
	AMDGPU_ATTRIBUTE(WORKITEM_ID_Z, "amdgpu-no-workitem-id-z")			AMDGPU_ATTRIBUTE(WORKITEM_ID_Z, "amdgpu-no-workitem-id-z")

	#undef AMDGPU_ATTRIBUTE			#undef AMDGPU_ATTRIBUTE

llvm/lib/Target/AMDGPU/AMDGPUAttributor.cpp

Show First 20 Lines • Show All 404 Lines • ▼ Show 20 Lines	if (!NeedsQueuePtr) {
NeedsQueuePtr = checkForQueuePtr(A);		NeedsQueuePtr = checkForQueuePtr(A);
}		}

if (NeedsQueuePtr) {		if (NeedsQueuePtr) {
removeAssumedBits(QUEUE_PTR);		removeAssumedBits(QUEUE_PTR);
}		}

if (funcRetrievesHostcallPtr(A)) {		if (funcRetrievesHostcallPtr(A)) {
removeAssumedBits(IMPLICIT_ARG_PTR);		assert(!isAssumed(IMPLICIT_ARG_PTR) && "hostcall needs implicitarg_ptr");
sameerdsUnsubmitted Not Done Reply Inline Actions Why was this removed? Is there some other place which guarantees the same relationship? sameerds: Why was this removed? Is there some other place which guarantees the same relationship?
cfangAuthorUnsubmitted Done Reply Inline Actions I think the explicit call of amdgcn_implicitarg_ptr will guarantee IMPLICIT_ARG_PTR. cfang: I think the explicit call of amdgcn_implicitarg_ptr will guarantee IMPLICIT_ARG_PTR.
sameerdsUnsubmitted Not Done Reply Inline Actions That sounds correct. But then this should be asserted somewhere ... probably in the code that manifests the attribute. sameerds: That sounds correct. But then this should be asserted somewhere ... probably in the code that…
cfangAuthorUnsubmitted Done Reply Inline Actions Do you think the for loop a few lines ahead in the same function should and "MUST" catch amdgcn_implicitarg_ptr? So that we don't need an assert? 386: for (Function Callee : AAEdges.getOptimisticEdges()) { cfang:* Do you think the for loop a few lines ahead in the same function should and "MUST" catch…
sameerdsUnsubmitted Not Done Reply Inline Actions Maybe it does. But that's not the point. This relationship between the two attributes is pretty important and we should assert. Even the coding standard says so [1] and it is pretty much a life-saver. Maybe I should have asserted myself, but that last change was kinda in a bit of a hurry! [1] https://llvm.org/docs/CodingStandards.html#assert-liberally sameerds: Maybe it does. But that's not the point. This relationship between the two attributes is pretty…
cfangAuthorUnsubmitted Done Reply Inline Actions Could not cfang: Could not
removeAssumedBits(HOSTCALL_PTR);		removeAssumedBits(HOSTCALL_PTR);
}		}

		if (funcRetrievesHeapPtr(A)) {
		assert(!isAssumed(IMPLICIT_ARG_PTR) && "heap_ptr needs implicitarg_ptr");
		removeAssumedBits(HEAP_PTR);
		}

return getAssumed() != OrigAssumed ? ChangeStatus::CHANGED		return getAssumed() != OrigAssumed ? ChangeStatus::CHANGED
: ChangeStatus::UNCHANGED;		: ChangeStatus::UNCHANGED;
}		}

ChangeStatus manifest(Attributor &A) override {		ChangeStatus manifest(Attributor &A) override {
SmallVector<Attribute, 8> AttrList;		SmallVector<Attribute, 8> AttrList;
LLVMContext &Ctx = getAssociatedFunction()->getContext();		LLVMContext &Ctx = getAssociatedFunction()->getContext();

▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	for (BasicBlock &BB : *F) {
}		}
}		}

return false;		return false;
}		}

bool funcRetrievesHostcallPtr(Attributor &A) {		bool funcRetrievesHostcallPtr(Attributor &A) {
auto Pos = llvm::AMDGPU::getHostcallImplicitArgPosition();		auto Pos = llvm::AMDGPU::getHostcallImplicitArgPosition();
		AAPointerInfo::OffsetAndSize OAS(Pos, 8);
		return funcRetrievesImplicitKernelArg(A, OAS);
		}

		bool funcRetrievesHeapPtr(Attributor &A) {
		if (AMDGPU::getAmdhsaCodeObjectVersion() != 5)
		return false;
		auto Pos = llvm::AMDGPU::getHeapPtrImplicitArgPosition();
		AAPointerInfo::OffsetAndSize OAS(Pos, 8);
		return funcRetrievesImplicitKernelArg(A, OAS);
		}

		bool funcRetrievesImplicitKernelArg(Attributor &A,
		AAPointerInfo::OffsetAndSize OAS) {
// Check if this is a call to the implicitarg_ptr builtin and it		// Check if this is a call to the implicitarg_ptr builtin and it
// is used to retrieve the hostcall pointer. The implicit arg for		// is used to retrieve the hostcall pointer. The implicit arg for
		b-sumnerUnsubmitted Done Reply Inline Actions We could potentially be more specific with KernargLoc instead of MemoryLoc. b-sumner: We could potentially be more specific with KernargLoc instead of MemoryLoc.
		sameerdsUnsubmitted Done Reply Inline Actions Or more importantly, the outer function needs a more specific name ... it is checking whether the supplied offset is being accessed from the implicitarg_ptr base. sameerds: Or more importantly, the outer function needs a more specific name ... it is checking whether…
		cfangAuthorUnsubmitted Done Reply Inline Actions Can we use "funcRetrievesImplicitKernarg(...) ? cfang: Can we use "funcRetrievesImplicitKernarg(...) ?
		sameerdsUnsubmitted Done Reply Inline Actions Sounds good to me. But "kernarg" usually refers to the segment ... maybe say "KernelArgument" instead? I am okay with either, since the function has a very limited scope. sameerds: Sounds good to me. But "kernarg" usually refers to the segment ... maybe say "KernelArgument"…
		cfangAuthorUnsubmitted Done Reply Inline Actions Will update as suggested. Thanks. cfang: Will update as suggested. Thanks.
// hostcall is not used only if every use of the implicitarg_ptr		// hostcall is not used only if every use of the implicitarg_ptr
// is a load that clearly does not retrieve any byte of the		// is a load that clearly does not retrieve any byte of the
// hostcall pointer. We check this by tracing all the uses of the		// hostcall pointer. We check this by tracing all the uses of the
// initial call to the implicitarg_ptr intrinsic.		// initial call to the implicitarg_ptr intrinsic.
auto DoesNotLeadToHostcallPtr = [&](Instruction &I) {		auto DoesNotLeadToKernelArgLoc = [&](Instruction &I) {
auto &Call = cast<CallBase>(I);		auto &Call = cast<CallBase>(I);
if (Call.getIntrinsicID() != Intrinsic::amdgcn_implicitarg_ptr)		if (Call.getIntrinsicID() != Intrinsic::amdgcn_implicitarg_ptr)
return true;		return true;

const auto &PointerInfoAA = A.getAAFor<AAPointerInfo>(		const auto &PointerInfoAA = A.getAAFor<AAPointerInfo>(
*this, IRPosition::callsite_returned(Call), DepClassTy::REQUIRED);		*this, IRPosition::callsite_returned(Call), DepClassTy::REQUIRED);

AAPointerInfo::OffsetAndSize OAS(Pos, 8);
return PointerInfoAA.forallInterferingAccesses(		return PointerInfoAA.forallInterferingAccesses(
OAS, [](const AAPointerInfo::Access &Acc, bool IsExact) {		OAS, [](const AAPointerInfo::Access &Acc, bool IsExact) {
return Acc.getRemoteInst()->isDroppable();		return Acc.getRemoteInst()->isDroppable();
});		});
};		};

bool UsedAssumedInformation = false;		bool UsedAssumedInformation = false;
return !A.checkForAllCallLikeInstructions(DoesNotLeadToHostcallPtr, *this,		return !A.checkForAllCallLikeInstructions(DoesNotLeadToKernelArgLoc, *this,
UsedAssumedInformation);		UsedAssumedInformation);
}		}
};		};

AAAMDAttributes &AAAMDAttributes::createForPosition(const IRPosition &IRP,		AAAMDAttributes &AAAMDAttributes::createForPosition(const IRPosition &IRP,
Attributor &A) {		Attributor &A) {
if (IRP.getPositionKind() == IRPosition::IRP_FUNCTION)		if (IRP.getPositionKind() == IRPosition::IRP_FUNCTION)
return *new (A.Allocator) AAAMDAttributesFunction(IRP, A);		return *new (A.Allocator) AAAMDAttributesFunction(IRP, A);
▲ Show 20 Lines • Show All 174 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AMDGPUHSAMetadataStreamer.cpp

Show First 20 Lines • Show All 1,022 Lines • ▼ Show 20 Lines	if (MFI.hasHostcallPtr()) {
emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_hostcall_buffer", Offset,		emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_hostcall_buffer", Offset,
Args);		Args);
} else		} else
Offset += 8; // Skipped.		Offset += 8; // Skipped.

emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_multigrid_sync_arg", Offset,		emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_multigrid_sync_arg", Offset,
Args);		Args);

// Ignore temporarily until it is implemented.		if (MFI.hasHeapPtr())
// emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_heap_v1", Offset, Args);		emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_heap_v1", Offset, Args);
Offset += 8;		else
		Offset += 8; // Skipped.

if (Func.hasFnAttribute("calls-enqueue-kernel")) {		if (Func.hasFnAttribute("calls-enqueue-kernel")) {
emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_default_queue", Offset,		emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_default_queue", Offset,
Args);		Args);
emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_completion_action", Offset,		emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_completion_action", Offset,
Args);		Args);
} else		} else
Offset += 16; // Skipped.		Offset += 16; // Skipped.
Show All 17 Lines

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h

Show First 20 Lines • Show All 416 Lines • ▼ Show 20 Lines	private:
// Compute directly in sgpr[0:1]		// Compute directly in sgpr[0:1]
// Other shaders indirect 64-bits at sgpr[0:1]		// Other shaders indirect 64-bits at sgpr[0:1]
bool ImplicitBufferPtr : 1;		bool ImplicitBufferPtr : 1;

// Pointer to where the ABI inserts special kernel arguments separate from the		// Pointer to where the ABI inserts special kernel arguments separate from the
// user arguments. This is an offset from the KernargSegmentPtr.		// user arguments. This is an offset from the KernargSegmentPtr.
bool ImplicitArgPtr : 1;		bool ImplicitArgPtr : 1;
bool HostcallPtr : 1;		bool HostcallPtr : 1;
		bool HeapPtr : 1;

bool MayNeedAGPRs : 1;		bool MayNeedAGPRs : 1;

// The hard-wired high half of the address of the global information table		// The hard-wired high half of the address of the global information table
// for AMDPAL OS type. 0xffffffff represents no hard-wired high half, since		// for AMDPAL OS type. 0xffffffff represents no hard-wired high half, since
// current hardware only allows a 16 bit value.		// current hardware only allows a 16 bit value.
unsigned GITPtrHigh;		unsigned GITPtrHigh;

▲ Show 20 Lines • Show All 263 Lines • ▼ Show 20 Lines	public:
bool hasImplicitArgPtr() const {		bool hasImplicitArgPtr() const {
return ImplicitArgPtr;		return ImplicitArgPtr;
}		}

bool hasHostcallPtr() const {		bool hasHostcallPtr() const {
return HostcallPtr;		return HostcallPtr;
}		}

		bool hasHeapPtr () const {
		return HeapPtr;
		}

bool hasImplicitBufferPtr() const {		bool hasImplicitBufferPtr() const {
return ImplicitBufferPtr;		return ImplicitBufferPtr;
}		}

AMDGPUFunctionArgInfo &getArgInfo() {		AMDGPUFunctionArgInfo &getArgInfo() {
return ArgInfo;		return ArgInfo;
}		}

▲ Show 20 Lines • Show All 270 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp

Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	: AMDGPUMachineFunction(MF),
WorkGroupInfo(false),		WorkGroupInfo(false),
PrivateSegmentWaveByteOffset(false),		PrivateSegmentWaveByteOffset(false),
WorkItemIDX(false),		WorkItemIDX(false),
WorkItemIDY(false),		WorkItemIDY(false),
WorkItemIDZ(false),		WorkItemIDZ(false),
ImplicitBufferPtr(false),		ImplicitBufferPtr(false),
ImplicitArgPtr(false),		ImplicitArgPtr(false),
HostcallPtr(false),		HostcallPtr(false),
		HeapPtr(false),
GITPtrHigh(0xffffffff),		GITPtrHigh(0xffffffff),
HighBitsOf32BitAddress(0),		HighBitsOf32BitAddress(0),
GDSSize(0) {		GDSSize(0) {
const GCNSubtarget &ST = MF.getSubtarget<GCNSubtarget>();		const GCNSubtarget &ST = MF.getSubtarget<GCNSubtarget>();
const Function &F = MF.getFunction();		const Function &F = MF.getFunction();
FlatWorkGroupSizes = ST.getFlatWorkGroupSizes(F);		FlatWorkGroupSizes = ST.getFlatWorkGroupSizes(F);
WavesPerEU = ST.getWavesPerEU(F);		WavesPerEU = ST.getWavesPerEU(F);

▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	if (!AMDGPU::isGraphics(CC)) {
if (!F.hasFnAttribute("amdgpu-no-queue-ptr"))		if (!F.hasFnAttribute("amdgpu-no-queue-ptr"))
QueuePtr = true;		QueuePtr = true;

if (!F.hasFnAttribute("amdgpu-no-dispatch-id"))		if (!F.hasFnAttribute("amdgpu-no-dispatch-id"))
DispatchID = true;		DispatchID = true;

if (!F.hasFnAttribute("amdgpu-no-hostcall-ptr"))		if (!F.hasFnAttribute("amdgpu-no-hostcall-ptr"))
HostcallPtr = true;		HostcallPtr = true;

		if (!F.hasFnAttribute("amdgpu-no-heap-ptr"))
		HeapPtr = true;
}		}

// FIXME: This attribute is a hack, we just need an analysis on the function		// FIXME: This attribute is a hack, we just need an analysis on the function
// to look for allocas.		// to look for allocas.
bool HasStackObjects = F.hasFnAttribute("amdgpu-stack-objects");		bool HasStackObjects = F.hasFnAttribute("amdgpu-stack-objects");

// TODO: This could be refined a lot. The attribute is a poor way of		// TODO: This could be refined a lot. The attribute is a poor way of
// detecting calls or stack objects that may require it before argument		// detecting calls or stack objects that may require it before argument
▲ Show 20 Lines • Show All 536 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h

	Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
	/// false otherwise.			/// false otherwise.
	bool isHsaAbiVersion3(const MCSubtargetInfo *STI);			bool isHsaAbiVersion3(const MCSubtargetInfo *STI);
	/// \returns True if HSA OS ABI Version identification is 4,			/// \returns True if HSA OS ABI Version identification is 4,
	/// false otherwise.			/// false otherwise.
	bool isHsaAbiVersion4(const MCSubtargetInfo *STI);			bool isHsaAbiVersion4(const MCSubtargetInfo *STI);
	/// \returns True if HSA OS ABI Version identification is 5,			/// \returns True if HSA OS ABI Version identification is 5,
	/// false otherwise.			/// false otherwise.
	bool isHsaAbiVersion5(const MCSubtargetInfo *STI);			bool isHsaAbiVersion5(const MCSubtargetInfo *STI);
	/// \returns True if HSA OS ABI Version identification is 3 or 4,			/// \returns True if HSA OS ABI Version identification is 3 and above,
	/// false otherwise.			/// false otherwise.
	bool isHsaAbiVersion3AndAbove(const MCSubtargetInfo *STI);			bool isHsaAbiVersion3AndAbove(const MCSubtargetInfo *STI);

	/// \returns The offset of the hostcall pointer argument from implicitarg_ptr			/// \returns The offset of the hostcall pointer argument from implicitarg_ptr
	sameerdsUnsubmitted Not Done Reply Inline Actions The first letter should be capital. `\returns` is not the beginning of the sentence ... doxygen will render only the stuff that follows the keyword. sameerds: The first letter should be capital. `\returns` is not the beginning of the sentence ... doxygen…
	cfangAuthorUnsubmitted Done Reply Inline Actions I see. Thanks for pointing out. cfang: I see. Thanks for pointing out.
	unsigned getHostcallImplicitArgPosition();			unsigned getHostcallImplicitArgPosition();

				/// \returns The offset of the heap ptr argument from implicitarg_ptr
				unsigned getHeapPtrImplicitArgPosition();

				/// \returns Code object version.
				unsigned getAmdhsaCodeObjectVersion();

	struct GcnBufferFormatInfo {			struct GcnBufferFormatInfo {
	unsigned Format;			unsigned Format;
	unsigned BitsPerComp;			unsigned BitsPerComp;
	unsigned NumComponents;			unsigned NumComponents;
	unsigned NumFormat;			unsigned NumFormat;
	unsigned DataFormat;			unsigned DataFormat;
	};			};

	▲ Show 20 Lines • Show All 978 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp

Show First 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	bool isHsaAbiVersion5(const MCSubtargetInfo *STI) {
return false;		return false;
}		}

bool isHsaAbiVersion3AndAbove(const MCSubtargetInfo *STI) {		bool isHsaAbiVersion3AndAbove(const MCSubtargetInfo *STI) {
return isHsaAbiVersion3(STI) \|\| isHsaAbiVersion4(STI) \|\|		return isHsaAbiVersion3(STI) \|\| isHsaAbiVersion4(STI) \|\|
isHsaAbiVersion5(STI);		isHsaAbiVersion5(STI);
}		}

		unsigned getAmdhsaCodeObjectVersion() {
		return AmdhsaCodeObjectVersion;
		}

// FIXME: All such magic numbers about the ABI should be in a		// FIXME: All such magic numbers about the ABI should be in a
// central TD file.		// central TD file.
unsigned getHostcallImplicitArgPosition() {		unsigned getHostcallImplicitArgPosition() {
switch (AmdhsaCodeObjectVersion) {		switch (AmdhsaCodeObjectVersion) {
case 2:		case 2:
case 3:		case 3:
case 4:		case 4:
return 24;		return 24;
case 5:		case 5:
return 80;		return 80;
default:		default:
llvm_unreachable("Unexpected code object version");		llvm_unreachable("Unexpected code object version");
return 0;		return 0;
}		}
}		}

		unsigned getHeapPtrImplicitArgPosition() {
		if (AmdhsaCodeObjectVersion == 5)
		return 96;
		llvm_unreachable("hidden_heap is supported only by code object version 5");
		return 0;
		}

#define GET_MIMGBaseOpcodesTable_IMPL		#define GET_MIMGBaseOpcodesTable_IMPL
#define GET_MIMGDimInfoTable_IMPL		#define GET_MIMGDimInfoTable_IMPL
#define GET_MIMGInfoTable_IMPL		#define GET_MIMGInfoTable_IMPL
#define GET_MIMGLZMappingTable_IMPL		#define GET_MIMGLZMappingTable_IMPL
#define GET_MIMGMIPMappingTable_IMPL		#define GET_MIMGMIPMappingTable_IMPL
#define GET_MIMGBiasMappingTable_IMPL		#define GET_MIMGBiasMappingTable_IMPL
#define GET_MIMGOffsetMappingTable_IMPL		#define GET_MIMGOffsetMappingTable_IMPL
#define GET_MIMGG16MappingTable_IMPL		#define GET_MIMGG16MappingTable_IMPL
▲ Show 20 Lines • Show All 1,942 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/addrspacecast-constantexpr.ll

	Show First 20 Lines • Show All 224 Lines • ▼ Show 20 Lines

	attributes #0 = { argmemonly nounwind }			attributes #0 = { argmemonly nounwind }
	attributes #1 = { nounwind }			attributes #1 = { nounwind }
	;.			;.
	; AKF_HSA: attributes #[[ATTR0:[0-9]+]] = { argmemonly nofree nounwind willreturn }			; AKF_HSA: attributes #[[ATTR0:[0-9]+]] = { argmemonly nofree nounwind willreturn }
	; AKF_HSA: attributes #[[ATTR1]] = { nounwind }			; AKF_HSA: attributes #[[ATTR1]] = { nounwind }
	;.			;.
	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { argmemonly nofree nounwind willreturn }			; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { argmemonly nofree nounwind willreturn }
	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.			;.

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa-call.ll

	Show First 20 Lines • Show All 931 Lines • ▼ Show 20 Lines
	; AKF_HSA: attributes #[[ATTR1]] = { nounwind "target-cpu"="fiji" }			; AKF_HSA: attributes #[[ATTR1]] = { nounwind "target-cpu"="fiji" }
	; AKF_HSA: attributes #[[ATTR2]] = { nounwind "target-cpu"="gfx900" }			; AKF_HSA: attributes #[[ATTR2]] = { nounwind "target-cpu"="gfx900" }
	; AKF_HSA: attributes #[[ATTR3]] = { nounwind }			; AKF_HSA: attributes #[[ATTR3]] = { nounwind }
	; AKF_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-calls" }			; AKF_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-calls" }
	; AKF_HSA: attributes #[[ATTR5]] = { nounwind sanitize_address }			; AKF_HSA: attributes #[[ATTR5]] = { nounwind sanitize_address }
	; AKF_HSA: attributes #[[ATTR6:[0-9]+]] = { nounwind sanitize_address "amdgpu-no-implicitarg-ptr" }			; AKF_HSA: attributes #[[ATTR6:[0-9]+]] = { nounwind sanitize_address "amdgpu-no-implicitarg-ptr" }
	;.			;.
	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }			; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR10]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR10]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR11]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR11]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR12]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="gfx900" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR12]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="gfx900" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR13]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="gfx900" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR13]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="gfx900" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR14]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR14]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR15]] = { nounwind "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR15]] = { nounwind "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR16]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR16]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR17]] = { nounwind sanitize_address "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR17]] = { nounwind sanitize_address "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR18]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR18]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR19:[0-9]+]] = { nounwind sanitize_address "amdgpu-no-implicitarg-ptr" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR19:[0-9]+]] = { nounwind sanitize_address "amdgpu-no-implicitarg-ptr" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR20]] = { nounwind }			; ATTRIBUTOR_HSA: attributes #[[ATTR20]] = { nounwind }
	;.			;.

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa.ll

	Show First 20 Lines • Show All 641 Lines • ▼ Show 20 Lines
	attributes #1 = { nounwind }			attributes #1 = { nounwind }

	;.			;.
	; AKF_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }			; AKF_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; AKF_HSA: attributes #[[ATTR1]] = { nounwind }			; AKF_HSA: attributes #[[ATTR1]] = { nounwind }
	; AKF_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-stack-objects" }			; AKF_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-stack-objects" }
	;.			;.
	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }			; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR10]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR10]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR11]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_HSA: attributes #[[ATTR11]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.			;.

llvm/test/CodeGen/AMDGPU/annotate-kernel-features.ll

	Show First 20 Lines • Show All 412 Lines • ▼ Show 20 Lines
	; NOHSA: attributes #[[ATTR7]] = { nounwind "amdgpu-work-group-id-y" "amdgpu-work-item-id-y" "uniform-work-group-size"="false" }			; NOHSA: attributes #[[ATTR7]] = { nounwind "amdgpu-work-group-id-y" "amdgpu-work-item-id-y" "uniform-work-group-size"="false" }
	; NOHSA: attributes #[[ATTR8]] = { nounwind "amdgpu-work-item-id-y" "amdgpu-work-item-id-z" "uniform-work-group-size"="false" }			; NOHSA: attributes #[[ATTR8]] = { nounwind "amdgpu-work-item-id-y" "amdgpu-work-item-id-z" "uniform-work-group-size"="false" }
	; NOHSA: attributes #[[ATTR9]] = { nounwind "amdgpu-work-group-id-y" "amdgpu-work-group-id-z" "amdgpu-work-item-id-y" "amdgpu-work-item-id-z" "uniform-work-group-size"="false" }			; NOHSA: attributes #[[ATTR9]] = { nounwind "amdgpu-work-group-id-y" "amdgpu-work-group-id-z" "amdgpu-work-item-id-y" "amdgpu-work-item-id-z" "uniform-work-group-size"="false" }
	;.			;.
	; AKF_CHECK: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }			; AKF_CHECK: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; AKF_CHECK: attributes #[[ATTR1]] = { nounwind }			; AKF_CHECK: attributes #[[ATTR1]] = { nounwind }
	;.			;.
	; ATTRIBUTOR_CHECK: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }			; ATTRIBUTOR_CHECK: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }			; ATTRIBUTOR_CHECK: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }
	;.			;.

llvm/test/CodeGen/AMDGPU/direct-indirect-call.ll

	Show All 29 Lines
	; CHECK-SAME: () #[[ATTR1]] {			; CHECK-SAME: () #[[ATTR1]] {
	; CHECK-NEXT: call void @direct()			; CHECK-NEXT: call void @direct()
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	call void @direct()			call void @direct()
	ret void			ret void
	}			}
	;.			;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR1]] = { "uniform-work-group-size"="false" }
	;.			;.

llvm/test/CodeGen/AMDGPU/duplicate-attribute-indirect.ll

Show All 36 Lines	;
ret void		ret void
}		}

attributes #0 = { "amdgpu-no-dispatch-id" }		attributes #0 = { "amdgpu-no-dispatch-id" }

;.		;.
; AKF_GCN: attributes #[[ATTR0]] = { "amdgpu-calls" "amdgpu-no-dispatch-id" "amdgpu-stack-objects" }		; AKF_GCN: attributes #[[ATTR0]] = { "amdgpu-calls" "amdgpu-no-dispatch-id" "amdgpu-stack-objects" }
;.		;.
; ATTRIBUTOR_GCN: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }		; ATTRIBUTOR_GCN: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
; ATTRIBUTOR_GCN: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "uniform-work-group-size"="false" }		; ATTRIBUTOR_GCN: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "uniform-work-group-size"="false" }
;.		;.

llvm/test/CodeGen/AMDGPU/hsa-metadata-heap-v5.ll

This file was added.

				; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx900 --amdhsa-code-object-version=5 -filetype=obj -o - < %s \| llvm-readelf --notes - \| FileCheck %s
				; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx900 --amdhsa-code-object-version=5 < %s \| FileCheck --check-prefix=CHECK %s

				declare void @function1()

				declare void @function2() #0

				; Function Attrs: noinline
				define void @function3(i8 addrspace(4)* %argptr, i8 addrspace(4)* addrspace(1)* %sink) #2 {
				store i8 addrspace(4)* %argptr, i8 addrspace(4)* addrspace(1)* %sink, align 8
				ret void
				}

				; Function Attrs: noinline
				define void @function4(i64 %arg, i64* %a) #2 {
				store i64 %arg, i64* %a
				ret void
				}

				; Function Attrs: noinline
				define void @function5(i8 addrspace(4)* %ptr, i64* %sink) #2 {
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 64
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %sink
				ret void
				}

				; Function Attrs: nounwind readnone speculatable willreturn
				declare align 4 i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr() #1

				; CHECK: amdhsa.kernels:
				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel10
				define amdgpu_kernel void @test_kernel10(i8* %a) {
				store i8 3, i8* %a, align 1
				ret void
				}

				; Call to an extern function

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel20
				define amdgpu_kernel void @test_kernel20(i8* %a) {
				call void @function1()
				store i8 3, i8* %a, align 1
				ret void
				}

				; Explicit attribute on kernel

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel21
				define amdgpu_kernel void @test_kernel21(i8* %a) #0 {
				call void @function1()
				store i8 3, i8* %a, align 1
				ret void
				}

				; Explicit attribute on extern callee

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel22
				define amdgpu_kernel void @test_kernel22(i8* %a) {
				call void @function2()
				store i8 3, i8* %a, align 1
				ret void
				}

				; Access more bytes than the pointer size

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel30
				define amdgpu_kernel void @test_kernel30(i128* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 88
				%cast = bitcast i8 addrspace(4)* %gep to i128 addrspace(4)*
				%x = load i128, i128 addrspace(4)* %cast
				store i128 %x, i128* %a
				ret void
				}

				; Typical load of heap buffer pointer

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel40
				define amdgpu_kernel void @test_kernel40(i64* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 96
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %a
				ret void
				}

				; Typical usage, overriden by explicit attribute on kernel

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel41
				define amdgpu_kernel void @test_kernel41(i64* %a) #0 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 96
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %a
				ret void
				}

				; Access to implicit arg before the heap pointer

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel42
				define amdgpu_kernel void @test_kernel42(i64* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 88
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %a
				ret void
				}

				; Access to implicit arg after the heap pointer

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel43
				define amdgpu_kernel void @test_kernel43(i64* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 104
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %a
				ret void
				}

				; Accessing a byte just before the heap pointer

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel44
				define amdgpu_kernel void @test_kernel44(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 95
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Accessing a byte inside the heap pointer

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel45
				define amdgpu_kernel void @test_kernel45(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 96
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Accessing a byte inside the heap pointer

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel46
				define amdgpu_kernel void @test_kernel46(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 103
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Accessing a byte just after the heap pointer

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel47
				define amdgpu_kernel void @test_kernel47(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 104
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Access with an unknown offset

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel50
				define amdgpu_kernel void @test_kernel50(i8* %a, i32 %b) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i32 %b
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Multiple geps reaching the heap pointer argument.

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel51
				define amdgpu_kernel void @test_kernel51(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep1 = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 16
				%gep2 = getelementptr inbounds i8, i8 addrspace(4)* %gep1, i64 80
				%x = load i8, i8 addrspace(4)* %gep2, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Multiple geps not reaching the heap pointer argument.

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel52
				define amdgpu_kernel void @test_kernel52(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep1 = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 16
				%gep2 = getelementptr inbounds i8, i8 addrspace(4)* %gep1, i64 16
				%x = load i8, i8 addrspace(4)* %gep2, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Heap pointer used inside a function call

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel60
				define amdgpu_kernel void @test_kernel60(i64* %a) #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 96
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				call void @function4(i64 %x, i64* %a)
				ret void
				}

				; Heap pointer retrieved inside a function call; chain of geps

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel61
				define amdgpu_kernel void @test_kernel61(i64* %a) #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 32
				call void @function5(i8 addrspace(4)* %gep, i64* %a)
				ret void
				}

				; Pointer captured

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel70
				define amdgpu_kernel void @test_kernel70(i8 addrspace(4)* addrspace(1)* %sink) #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i32 42
				store i8 addrspace(4)* %gep, i8 addrspace(4)* addrspace(1)* %sink, align 8
				ret void
				}

				; Pointer captured inside function call

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel71
				define amdgpu_kernel void @test_kernel71(i8 addrspace(4)* addrspace(1)* %sink) #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i32 42
				call void @function3(i8 addrspace(4)* %gep, i8 addrspace(4)* addrspace(1)* %sink)
				ret void
				}

				; Ineffective pointer capture

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel72
				define amdgpu_kernel void @test_kernel72() #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i32 42
				store i8 addrspace(4)* %gep, i8 addrspace(4)* addrspace(1)* undef, align 8
				ret void
				}

				attributes #0 = { "amdgpu-no-heap-ptr" }
				attributes #1 = { nounwind readnone speculatable willreturn }
				attributes #2 = { noinline }

llvm/test/CodeGen/AMDGPU/hsa-metadata-hidden-args-v5.ll

	Show First 20 Lines • Show All 69 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: - .address_space: global			; CHECK-NEXT: - .address_space: global
	; CHECK-NEXT: .offset: 104			; CHECK-NEXT: .offset: 104
	; CHECK-NEXT: .size: 8			; CHECK-NEXT: .size: 8
	; CHECK-NEXT: .value_kind: hidden_hostcall_buffer			; CHECK-NEXT: .value_kind: hidden_hostcall_buffer
	; CHECK-NEXT: - .address_space: global			; CHECK-NEXT: - .address_space: global
	; CHECK-NEXT: .offset: 112			; CHECK-NEXT: .offset: 112
	; CHECK-NEXT: .size: 8			; CHECK-NEXT: .size: 8
	; CHECK-NEXT: .value_kind: hidden_multigrid_sync_arg			; CHECK-NEXT: .value_kind: hidden_multigrid_sync_arg
	; CHECK-NEXT: - .address_space: global			; CHECK-NEXT: - .address_space: global
				; CHECK-NEXT: .offset: 120
				; CHECK-NEXT: .size: 8
				; CHECK-NEXT: .value_kind: hidden_heap_v1
				; CHECK-NEXT: - .address_space: global
	; CHECK-NEXT: .offset: 128			; CHECK-NEXT: .offset: 128
	; CHECK-NEXT: .size: 8			; CHECK-NEXT: .size: 8
	; CHECK-NEXT: .value_kind: hidden_default_queue			; CHECK-NEXT: .value_kind: hidden_default_queue
	; CHECK-NEXT: - .address_space: global			; CHECK-NEXT: - .address_space: global
	; CHECK-NEXT: .offset: 136			; CHECK-NEXT: .offset: 136
	; CHECK-NEXT: .size: 8			; CHECK-NEXT: .size: 8
	; CHECK-NEXT: .value_kind: hidden_completion_action			; CHECK-NEXT: .value_kind: hidden_completion_action
	; GFX8-NEXT: - .offset: 216			; GFX8-NEXT: - .offset: 216
	Show All 35 Lines

llvm/test/CodeGen/AMDGPU/propagate-flat-work-group-size.ll

	Show First 20 Lines • Show All 196 Lines • ▼ Show 20 Lines
	attributes #1 = { "amdgpu-flat-work-group-size"="64,128" }			attributes #1 = { "amdgpu-flat-work-group-size"="64,128" }
	attributes #2 = { "amdgpu-flat-work-group-size"="64,64" }			attributes #2 = { "amdgpu-flat-work-group-size"="64,64" }
	attributes #3 = { "amdgpu-flat-work-group-size"="128,256" }			attributes #3 = { "amdgpu-flat-work-group-size"="128,256" }
	attributes #4 = { "amdgpu-flat-work-group-size"="512,1024" }			attributes #4 = { "amdgpu-flat-work-group-size"="512,1024" }
	attributes #5 = { "amdgpu-flat-work-group-size"="128,512" }			attributes #5 = { "amdgpu-flat-work-group-size"="128,512" }
	attributes #6 = { "amdgpu-flat-work-group-size"="512,512" }			attributes #6 = { "amdgpu-flat-work-group-size"="512,512" }
	attributes #7 = { "amdgpu-flat-work-group-size"="64,256" }			attributes #7 = { "amdgpu-flat-work-group-size"="64,256" }
	;.			;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-flat-work-group-size"="1,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR0]] = { "amdgpu-flat-work-group-size"="1,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "amdgpu-flat-work-group-size"="64,128" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR1]] = { "amdgpu-flat-work-group-size"="64,128" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR2]] = { "amdgpu-flat-work-group-size"="128,512" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR2]] = { "amdgpu-flat-work-group-size"="128,512" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR3]] = { "amdgpu-flat-work-group-size"="64,64" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR3]] = { "amdgpu-flat-work-group-size"="64,64" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR4]] = { "amdgpu-flat-work-group-size"="128,128" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR4]] = { "amdgpu-flat-work-group-size"="128,128" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR5]] = { "amdgpu-flat-work-group-size"="512,512" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR5]] = { "amdgpu-flat-work-group-size"="512,512" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR6]] = { "amdgpu-flat-work-group-size"="64,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR6]] = { "amdgpu-flat-work-group-size"="64,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR7]] = { "amdgpu-flat-work-group-size"="128,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR7]] = { "amdgpu-flat-work-group-size"="128,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR8]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR8]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.			;.

llvm/test/CodeGen/AMDGPU/simple-indirect-call.ll

Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	; GFX9-NEXT: s_endpgm
%fp = load void(), void()* %fptr.cast		%fp = load void(), void()* %fptr.cast
call void %fp()		call void %fp()
ret void		ret void
}		}

;.		;.
; AKF_GCN: attributes #[[ATTR0]] = { "amdgpu-calls" "amdgpu-stack-objects" }		; AKF_GCN: attributes #[[ATTR0]] = { "amdgpu-calls" "amdgpu-stack-objects" }
;.		;.
; ATTRIBUTOR_GCN: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }		; ATTRIBUTOR_GCN: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
; ATTRIBUTOR_GCN: attributes #[[ATTR1]] = { "uniform-work-group-size"="false" }		; ATTRIBUTOR_GCN: attributes #[[ATTR1]] = { "uniform-work-group-size"="false" }
;.		;.

llvm/test/CodeGen/AMDGPU/uniform-work-group-attribute-missing.ll

	Show All 25 Lines
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	call void @foo()			call void @foo()
	ret void			ret void
	}			}

	attributes #0 = { "uniform-work-group-size"="true" }			attributes #0 = { "uniform-work-group-size"="true" }
	;.			;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.			;.

llvm/test/CodeGen/AMDGPU/uniform-work-group-multistep.ll

	Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	call void @internal2()			call void @internal2()
	ret void			ret void
	}			}

	attributes #0 = { "uniform-work-group-size"="true" }			attributes #0 = { "uniform-work-group-size"="true" }
	;.			;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }			; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
	;.			;.

llvm/test/CodeGen/AMDGPU/uniform-work-group-nested-function-calls.ll

	Show All 35 Lines
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	call void @func2()			call void @func2()
	ret void			ret void
	}			}

	attributes #2 = { "uniform-work-group-size"="true" }			attributes #2 = { "uniform-work-group-size"="true" }
	;.			;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }			; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
	;.			;.

llvm/test/CodeGen/AMDGPU/uniform-work-group-prevent-attribute-propagation.ll

	Show All 35 Lines
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	call void @func()			call void @func()
	ret void			ret void
	}			}

	attributes #1 = { "uniform-work-group-size"="true" }			attributes #1 = { "uniform-work-group-size"="true" }
	;.			;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }			; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
	;.			;.

llvm/test/CodeGen/AMDGPU/uniform-work-group-recursion-test.ll

Show First 20 Lines • Show All 95 Lines • ▼ Show 20 Lines	;
store i32 %r2, i32 addrspace(1)* %m		store i32 %r2, i32 addrspace(1)* %m
ret void		ret void
}		}

; nounwind and readnone are added to match attributor results.		; nounwind and readnone are added to match attributor results.
attributes #0 = { nounwind readnone }		attributes #0 = { nounwind readnone }
attributes #1 = { "uniform-work-group-size"="true" }		attributes #1 = { "uniform-work-group-size"="true" }
;.		;.
; CHECK: attributes #[[ATTR0]] = { nounwind readnone "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }		; CHECK: attributes #[[ATTR0]] = { nounwind readnone "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
; CHECK: attributes #[[ATTR1]] = { nounwind readnone "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }		; CHECK: attributes #[[ATTR1]] = { nounwind readnone "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
; CHECK: attributes #[[ATTR2]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }		; CHECK: attributes #[[ATTR2]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
;.		;.

llvm/test/CodeGen/AMDGPU/uniform-work-group-test.ll

	Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines
	;			;
	call void @func2()			call void @func2()
	call void @func3()			call void @func3()
	ret void			ret void
	}			}

	attributes #0 = { "uniform-work-group-size"="false" }			attributes #0 = { "uniform-work-group-size"="false" }
	;.			;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }			; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.			;.

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][NFC]: Emit metadata for hidden_heap_v1 kernarg
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 411458

llvm/docs/AMDGPUUsage.rst

llvm/lib/BinaryFormat/AMDGPUMetadataVerifier.cpp

llvm/lib/Target/AMDGPU/AMDGPUAttributes.def

llvm/lib/Target/AMDGPU/AMDGPUAttributor.cpp

llvm/lib/Target/AMDGPU/AMDGPUHSAMetadataStreamer.cpp

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp

llvm/test/CodeGen/AMDGPU/addrspacecast-constantexpr.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa-call.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features.ll

llvm/test/CodeGen/AMDGPU/direct-indirect-call.ll

llvm/test/CodeGen/AMDGPU/duplicate-attribute-indirect.ll

llvm/test/CodeGen/AMDGPU/hsa-metadata-heap-v5.ll

llvm/test/CodeGen/AMDGPU/hsa-metadata-hidden-args-v5.ll

llvm/test/CodeGen/AMDGPU/propagate-flat-work-group-size.ll

llvm/test/CodeGen/AMDGPU/simple-indirect-call.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-attribute-missing.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-multistep.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-nested-function-calls.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-prevent-attribute-propagation.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-recursion-test.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-test.ll

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][NFC]: Emit metadata for hidden_heap_v1 kernargClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 411458

llvm/docs/AMDGPUUsage.rst

llvm/lib/BinaryFormat/AMDGPUMetadataVerifier.cpp

llvm/lib/Target/AMDGPU/AMDGPUAttributes.def

llvm/lib/Target/AMDGPU/AMDGPUAttributor.cpp

llvm/lib/Target/AMDGPU/AMDGPUHSAMetadataStreamer.cpp

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp

llvm/test/CodeGen/AMDGPU/addrspacecast-constantexpr.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa-call.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features.ll

llvm/test/CodeGen/AMDGPU/direct-indirect-call.ll

llvm/test/CodeGen/AMDGPU/duplicate-attribute-indirect.ll

llvm/test/CodeGen/AMDGPU/hsa-metadata-heap-v5.ll

llvm/test/CodeGen/AMDGPU/hsa-metadata-hidden-args-v5.ll

llvm/test/CodeGen/AMDGPU/propagate-flat-work-group-size.ll

llvm/test/CodeGen/AMDGPU/simple-indirect-call.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-attribute-missing.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-multistep.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-nested-function-calls.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-prevent-attribute-propagation.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-recursion-test.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-test.ll

[AMDGPU][NFC]: Emit metadata for hidden_heap_v1 kernarg
ClosedPublic