Diff 410338

llvm/docs/AMDGPUUsage.rst

	Show All 24 Lines
	"amdgpu-no-queue-ptr" Similar to amdgpu-no-workitem-id-x, except for the			"amdgpu-no-queue-ptr" Similar to amdgpu-no-workitem-id-x, except for the
	llvm.amdgcn.queue.ptr intrinsic. Note that unlike the other ABI hint			llvm.amdgcn.queue.ptr intrinsic. Note that unlike the other ABI hint
	attributes, the queue pointer may be required in situations where the			attributes, the queue pointer may be required in situations where the
	intrinsic call does not directly appear in the program. Some subtargets			intrinsic call does not directly appear in the program. Some subtargets
	require the queue pointer for to handle some addrspacecasts, as well			require the queue pointer for to handle some addrspacecasts, as well
	as the llvm.amdgcn.is.shared, llvm.amdgcn.is.private, llvm.trap, and			as the llvm.amdgcn.is.shared, llvm.amdgcn.is.private, llvm.trap, and
	llvm.debug intrinsics.			llvm.debug intrinsics.

				"amdgpu-no-hostcall-ptr" Similar to amdgpu-no-implicitarg-ptr, except specific to the implicit
				kernel argument that holds the pointer to the hostcall buffer. If this
				attribute is absent, then the amdgpu-no-implicitarg-ptr is also removed.

				"amdgpu-no-heap-ptr" Similar to amdgpu-no-implicitarg-ptr, except specific to the implicit
				kernel argument that holds the pointer to an initialized memory buffer
				that conforms to the requirements of the malloc/free device library V1
				version implementation. If this attribute is absent, then the
				amdgpu-no-implicitarg-ptr is also removed.

	======================================= ==========================================================			======================================= ==========================================================

	.. _amdgpu-elf-code-object:			.. _amdgpu-elf-code-object:

	ELF Code Object			ELF Code Object
	===============			===============

	The AMDGPU backend generates a standard ELF [ELF]_ relocatable code object that			The AMDGPU backend generates a standard ELF [ELF]_ relocatable code object that
	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	of the Z dimension, if it exists. Must be zero if a partial			of the Z dimension, if it exists. Must be zero if a partial
	work group does not exist in the Z dimension.			work group does not exist in the Z dimension.

	"hidden_grid_dims"			"hidden_grid_dims"
	The grid dispatch dimensionality. This is the same value			The grid dispatch dimensionality. This is the same value
	as the AQL dispatch packet dimensionality. Must be a value			as the AQL dispatch packet dimensionality. Must be a value
	between 1 and 3.			between 1 and 3.

				"hidden_heap_v1"
				A global address space pointer to an initialized memory
				buffer that conforms to the requirements of the malloc/free
				device library V1 version implementation.

	"hidden_private_base"			"hidden_private_base"
	The high 32 bits of the flat addressing private aperture base.			The high 32 bits of the flat addressing private aperture base.
	Only used by GFX8 to allow conversion between private segment			Only used by GFX8 to allow conversion between private segment
	and flat addresses. See :ref:`amdgpu-amdhsa-kernel-prolog-flat-scratch`.			and flat addresses. See :ref:`amdgpu-amdhsa-kernel-prolog-flat-scratch`.

	"hidden_shared_base"			"hidden_shared_base"
	The high 32 bits of the flat addressing shared aperture base.			The high 32 bits of the flat addressing shared aperture base.
	Only used by GFX8 to allow conversion between shared segment			Only used by GFX8 to allow conversion between shared segment
	Show All 24 Lines

llvm/lib/BinaryFormat/AMDGPUMetadataVerifier.cpp

Context not available.
	return false;	return false;
	if (!verifyIntegerEntry(ArgsMap, ".size", true))	if (!verifyIntegerEntry(ArgsMap, ".size", true))
	return false;	return false;
	if (!verifyIntegerEntry(ArgsMap, ".offset", true))	if (!verifyIntegerEntry(ArgsMap, ".offset", true))
	return false;	return false;
	if (!verifyScalarEntry(ArgsMap, ".value_kind", true,	if (!verifyScalarEntry(ArgsMap, ".value_kind", true,
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - if (!verifyScalarEntry(ArgsMap, ".value_kind", true, - msgpack::Type::String, + if (!verifyScalarEntry(ArgsMap, ".value_kind", true, msgpack::Type::String, Lint: Pre-merge checks: clang-format: please reformat the code ``` - if (!verifyScalarEntry(ArgsMap, ".value_kind"…
		sameerdsUnsubmitted Not Done Reply Inline Actions Seems to be a whitespace change introduced by clang-format? sameerds: Seems to be a whitespace change introduced by clang-format?
		cfangAuthorUnsubmitted Done Reply Inline Actions Right, a clang-format change. cfang: Right, a clang-format change.
	msgpack::Type::String,	msgpack::Type::String,
	[](msgpack::DocNode &SNode) {	[](msgpack::DocNode &SNode) {
	return StringSwitch<bool>(SNode.getString())	return StringSwitch<bool>(SNode.getString())
	.Case("by_value", true)	.Case("by_value", true)
	.Case("global_buffer", true)	.Case("global_buffer", true)
	.Case("dynamic_shared_pointer", true)	.Case("dynamic_shared_pointer", true)
	.Case("sampler", true)	.Case("sampler", true)
	.Case("image", true)	.Case("image", true)
	Show All 10 Lines
	.Case("hidden_remainder_z", true)	.Case("hidden_remainder_z", true)
	.Case("hidden_global_offset_x", true)	.Case("hidden_global_offset_x", true)
	.Case("hidden_global_offset_y", true)	.Case("hidden_global_offset_y", true)
	.Case("hidden_global_offset_z", true)	.Case("hidden_global_offset_z", true)
	.Case("hidden_grid_dims", true)	.Case("hidden_grid_dims", true)
	.Case("hidden_none", true)	.Case("hidden_none", true)
	.Case("hidden_printf_buffer", true)	.Case("hidden_printf_buffer", true)
	.Case("hidden_hostcall_buffer", true)	.Case("hidden_hostcall_buffer", true)
		.Case("hidden_heap_v1", true)
	.Case("hidden_default_queue", true)	.Case("hidden_default_queue", true)
	.Case("hidden_completion_action", true)	.Case("hidden_completion_action", true)
	.Case("hidden_multigrid_sync_arg", true)	.Case("hidden_multigrid_sync_arg", true)
	.Case("hidden_private_base", true)	.Case("hidden_private_base", true)
	.Case("hidden_shared_base", true)	.Case("hidden_shared_base", true)
	.Case("hidden_queue_ptr", true)	.Case("hidden_queue_ptr", true)
	.Default(false);	.Default(false);
	}))	}))
	Show All 24 Lines

llvm/lib/Target/AMDGPU/AMDGPUAttributes.def

	Show All 13 Lines

	// NOTE: NO INCLUDE GUARD DESIRED!			// NOTE: NO INCLUDE GUARD DESIRED!

	AMDGPU_ATTRIBUTE(DISPATCH_PTR, "amdgpu-no-dispatch-ptr")			AMDGPU_ATTRIBUTE(DISPATCH_PTR, "amdgpu-no-dispatch-ptr")
	AMDGPU_ATTRIBUTE(QUEUE_PTR, "amdgpu-no-queue-ptr")			AMDGPU_ATTRIBUTE(QUEUE_PTR, "amdgpu-no-queue-ptr")
	AMDGPU_ATTRIBUTE(DISPATCH_ID, "amdgpu-no-dispatch-id")			AMDGPU_ATTRIBUTE(DISPATCH_ID, "amdgpu-no-dispatch-id")
	AMDGPU_ATTRIBUTE(IMPLICIT_ARG_PTR, "amdgpu-no-implicitarg-ptr")			AMDGPU_ATTRIBUTE(IMPLICIT_ARG_PTR, "amdgpu-no-implicitarg-ptr")
	AMDGPU_ATTRIBUTE(HOSTCALL_PTR, "amdgpu-no-hostcall-ptr")			AMDGPU_ATTRIBUTE(HOSTCALL_PTR, "amdgpu-no-hostcall-ptr")
				AMDGPU_ATTRIBUTE(HEAP_PTR, "amdgpu-no-heap-ptr")
				sameerdsUnsubmitted Done Reply Inline Actions It seems we need to document these attributes in AMDGPUUsage, which I missed for no-hostcall-ptr. https://llvm.org/docs/AMDGPUUsage.html#llvm-ir-attributes sameerds: It seems we need to document these attributes in AMDGPUUsage, which I missed for no-hostcall…
				cfangAuthorUnsubmitted Done Reply Inline Actions Can you suggest the description for both no-hostcall and no-heap-ptr? Thanks. cfang: Can you suggest the description for both no-hostcall and no-heap-ptr? Thanks.
				sameerdsUnsubmitted Done Reply Inline Actions How about this: "Similar to amdgpu-no-implicitarg-ptr, except specific to the implicit kernel argument that holds the pointer to the hostcall buffer. If this attribute is absent, then the amdgpu-no-implicitarg-ptr is also removed." I do believe the removal already happens, but it will be good to convince ourselves about that. sameerds: How about this: "Similar to amdgpu-no-implicitarg-ptr, except specific to the implicit kernel…
				cfangAuthorUnsubmitted Done Reply Inline Actions Agreed! Thanks for the suggestions. cfang: Agreed! Thanks for the suggestions.
	AMDGPU_ATTRIBUTE(WORKGROUP_ID_X, "amdgpu-no-workgroup-id-x")			AMDGPU_ATTRIBUTE(WORKGROUP_ID_X, "amdgpu-no-workgroup-id-x")
	AMDGPU_ATTRIBUTE(WORKGROUP_ID_Y, "amdgpu-no-workgroup-id-y")			AMDGPU_ATTRIBUTE(WORKGROUP_ID_Y, "amdgpu-no-workgroup-id-y")
	AMDGPU_ATTRIBUTE(WORKGROUP_ID_Z, "amdgpu-no-workgroup-id-z")			AMDGPU_ATTRIBUTE(WORKGROUP_ID_Z, "amdgpu-no-workgroup-id-z")
	AMDGPU_ATTRIBUTE(WORKITEM_ID_X, "amdgpu-no-workitem-id-x")			AMDGPU_ATTRIBUTE(WORKITEM_ID_X, "amdgpu-no-workitem-id-x")
	AMDGPU_ATTRIBUTE(WORKITEM_ID_Y, "amdgpu-no-workitem-id-y")			AMDGPU_ATTRIBUTE(WORKITEM_ID_Y, "amdgpu-no-workitem-id-y")
	AMDGPU_ATTRIBUTE(WORKITEM_ID_Z, "amdgpu-no-workitem-id-z")			AMDGPU_ATTRIBUTE(WORKITEM_ID_Z, "amdgpu-no-workitem-id-z")

	#undef AMDGPU_ATTRIBUTE			#undef AMDGPU_ATTRIBUTE

llvm/lib/Target/AMDGPU/AMDGPUAttributor.cpp

	Show All 24 Lines
	if (!NeedsQueuePtr) {			if (!NeedsQueuePtr) {
	NeedsQueuePtr = checkForQueuePtr(A);			NeedsQueuePtr = checkForQueuePtr(A);
	}			}

	if (NeedsQueuePtr) {			if (NeedsQueuePtr) {
	removeAssumedBits(QUEUE_PTR);			removeAssumedBits(QUEUE_PTR);
	}			}

	if (funcRetrievesHostcallPtr(A)) {			if (funcRetrievesHostcallPtr(A))
	removeAssumedBits(IMPLICIT_ARG_PTR);
	sameerdsUnsubmitted Not Done Reply Inline Actions Why was this removed? Is there some other place which guarantees the same relationship? sameerds: Why was this removed? Is there some other place which guarantees the same relationship?
	cfangAuthorUnsubmitted Done Reply Inline Actions I think the explicit call of amdgcn_implicitarg_ptr will guarantee IMPLICIT_ARG_PTR. cfang: I think the explicit call of amdgcn_implicitarg_ptr will guarantee IMPLICIT_ARG_PTR.
	sameerdsUnsubmitted Not Done Reply Inline Actions That sounds correct. But then this should be asserted somewhere ... probably in the code that manifests the attribute. sameerds: That sounds correct. But then this should be asserted somewhere ... probably in the code that…
	cfangAuthorUnsubmitted Done Reply Inline Actions Do you think the for loop a few lines ahead in the same function should and "MUST" catch amdgcn_implicitarg_ptr? So that we don't need an assert? 386: for (Function Callee : AAEdges.getOptimisticEdges()) { cfang:* Do you think the for loop a few lines ahead in the same function should and "MUST" catch…
	sameerdsUnsubmitted Not Done Reply Inline Actions Maybe it does. But that's not the point. This relationship between the two attributes is pretty important and we should assert. Even the coding standard says so [1] and it is pretty much a life-saver. Maybe I should have asserted myself, but that last change was kinda in a bit of a hurry! [1] https://llvm.org/docs/CodingStandards.html#assert-liberally sameerds: Maybe it does. But that's not the point. This relationship between the two attributes is pretty…
	cfangAuthorUnsubmitted Done Reply Inline Actions Could not cfang: Could not
	removeAssumedBits(HOSTCALL_PTR);			removeAssumedBits(HOSTCALL_PTR);
	}
				if (funcRetrievesHeapPtr(A))
				removeAssumedBits(HEAP_PTR);

	return getAssumed() != OrigAssumed ? ChangeStatus::CHANGED			return getAssumed() != OrigAssumed ? ChangeStatus::CHANGED
	: ChangeStatus::UNCHANGED;			: ChangeStatus::UNCHANGED;
	}			}

	ChangeStatus manifest(Attributor &A) override {			ChangeStatus manifest(Attributor &A) override {
	SmallVector<Attribute, 8> AttrList;			SmallVector<Attribute, 8> AttrList;
	LLVMContext &Ctx = getAssociatedFunction()->getContext();			LLVMContext &Ctx = getAssociatedFunction()->getContext();
	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	}			}
	}			}

	return false;			return false;
	}			}

	bool funcRetrievesHostcallPtr(Attributor &A) {			bool funcRetrievesHostcallPtr(Attributor &A) {
	auto Pos = llvm::AMDGPU::getHostcallImplicitArgPosition();			auto Pos = llvm::AMDGPU::getHostcallImplicitArgPosition();
				AAPointerInfo::OffsetAndSize OAS(Pos, 8);
				return funcRetrievesImplicitKernelArg(A, OAS);
				}

				bool funcRetrievesHeapPtr(Attributor &A) {
				if (AMDGPU::getAmdhsaCodeObjectVersion() != 5)
				return false;
				auto Pos = llvm::AMDGPU::getHeapPtrImplicitArgPosition();
				AAPointerInfo::OffsetAndSize OAS(Pos, 8);
				return funcRetrievesImplicitKerneArg(A, OAS);
				}

				bool funcRetrievesImplicitKernelArg(Attributor &A,
				AAPointerInfo::OffsetAndSize OAS) {
	// Check if this is a call to the implicitarg_ptr builtin and it			// Check if this is a call to the implicitarg_ptr builtin and it
	// is used to retrieve the hostcall pointer. The implicit arg for			// is used to retrieve the hostcall pointer. The implicit arg for
	// hostcall is not used only if every use of the implicitarg_ptr			// hostcall is not used only if every use of the implicitarg_ptr
	// is a load that clearly does not retrieve any byte of the			// is a load that clearly does not retrieve any byte of the
	// hostcall pointer. We check this by tracing all the uses of the			// hostcall pointer. We check this by tracing all the uses of the
	// initial call to the implicitarg_ptr intrinsic.			// initial call to the implicitarg_ptr intrinsic.
				b-sumnerUnsubmitted Done Reply Inline Actions We could potentially be more specific with KernargLoc instead of MemoryLoc. b-sumner: We could potentially be more specific with KernargLoc instead of MemoryLoc.
				sameerdsUnsubmitted Done Reply Inline Actions Or more importantly, the outer function needs a more specific name ... it is checking whether the supplied offset is being accessed from the implicitarg_ptr base. sameerds: Or more importantly, the outer function needs a more specific name ... it is checking whether…
				cfangAuthorUnsubmitted Done Reply Inline Actions Can we use "funcRetrievesImplicitKernarg(...) ? cfang: Can we use "funcRetrievesImplicitKernarg(...) ?
				sameerdsUnsubmitted Done Reply Inline Actions Sounds good to me. But "kernarg" usually refers to the segment ... maybe say "KernelArgument" instead? I am okay with either, since the function has a very limited scope. sameerds: Sounds good to me. But "kernarg" usually refers to the segment ... maybe say "KernelArgument"…
				cfangAuthorUnsubmitted Done Reply Inline Actions Will update as suggested. Thanks. cfang: Will update as suggested. Thanks.
	auto DoesNotLeadToHostcallPtr = [&](Instruction &I) {			auto DoesNotLeadToKernelArgLoc = [&](Instruction &I) {
	auto &Call = cast<CallBase>(I);			auto &Call = cast<CallBase>(I);
	if (Call.getIntrinsicID() != Intrinsic::amdgcn_implicitarg_ptr)			if (Call.getIntrinsicID() != Intrinsic::amdgcn_implicitarg_ptr)
	return true;			return true;

	const auto &PointerInfoAA = A.getAAFor<AAPointerInfo>(			const auto &PointerInfoAA = A.getAAFor<AAPointerInfo>(
	*this, IRPosition::callsite_returned(Call), DepClassTy::REQUIRED);			*this, IRPosition::callsite_returned(Call), DepClassTy::REQUIRED);

	AAPointerInfo::OffsetAndSize OAS(Pos, 8);
	return PointerInfoAA.forallInterferingAccesses(			return PointerInfoAA.forallInterferingAccesses(
	OAS, [](const AAPointerInfo::Access &Acc, bool IsExact) {			OAS, [](const AAPointerInfo::Access &Acc, bool IsExact) {
	return Acc.getRemoteInst()->isDroppable();			return Acc.getRemoteInst()->isDroppable();
	});			});
	};			};

	bool UsedAssumedInformation = false;			bool UsedAssumedInformation = false;
	return !A.checkForAllCallLikeInstructions(DoesNotLeadToHostcallPtr, *this,			return !A.checkForAllCallLikeInstructions(DoesNotLeadToKernelArgLoc, *this,
	UsedAssumedInformation);			UsedAssumedInformation);
	}			}
	};			};

	AAAMDAttributes &AAAMDAttributes::createForPosition(const IRPosition &IRP,			AAAMDAttributes &AAAMDAttributes::createForPosition(const IRPosition &IRP,
	Attributor &A) {			Attributor &A) {
	if (IRP.getPositionKind() == IRPosition::IRP_FUNCTION)			if (IRP.getPositionKind() == IRPosition::IRP_FUNCTION)
	return *new (A.Allocator) AAAMDAttributesFunction(IRP, A);			return *new (A.Allocator) AAAMDAttributesFunction(IRP, A);
	Show All 24 Lines

llvm/lib/Target/AMDGPU/AMDGPUHSAMetadataStreamer.cpp

	Show All 24 Lines
	emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_hostcall_buffer", Offset,			emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_hostcall_buffer", Offset,
	Args);			Args);
	} else			} else
	Offset += 8; // Skipped.			Offset += 8; // Skipped.

	emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_multigrid_sync_arg", Offset,			emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_multigrid_sync_arg", Offset,
	Args);			Args);

	// Ignore temporarily until it is implemented.			if (MFI.hasHeapPtr())
	// emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_heap_v1", Offset, Args);			emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_heap_v1", Offset, Args);
	Offset += 8;			else
				Offset += 8; // Skipped.

	if (Func.hasFnAttribute("calls-enqueue-kernel")) {			if (Func.hasFnAttribute("calls-enqueue-kernel")) {
	emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_default_queue", Offset,			emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_default_queue", Offset,
	Args);			Args);
	emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_completion_action", Offset,			emitKernelArg(DL, Int8PtrTy, Align(8), "hidden_completion_action", Offset,
	Args);			Args);
	} else			} else
	Offset += 16; // Skipped.			Offset += 16; // Skipped.
	Show All 17 Lines

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h

	Show All 24 Lines
	// Compute directly in sgpr[0:1]			// Compute directly in sgpr[0:1]
	// Other shaders indirect 64-bits at sgpr[0:1]			// Other shaders indirect 64-bits at sgpr[0:1]
	bool ImplicitBufferPtr : 1;			bool ImplicitBufferPtr : 1;

	// Pointer to where the ABI inserts special kernel arguments separate from the			// Pointer to where the ABI inserts special kernel arguments separate from the
	// user arguments. This is an offset from the KernargSegmentPtr.			// user arguments. This is an offset from the KernargSegmentPtr.
	bool ImplicitArgPtr : 1;			bool ImplicitArgPtr : 1;
	bool HostcallPtr : 1;			bool HostcallPtr : 1;
				bool HeapPtr : 1;

	bool MayNeedAGPRs : 1;			bool MayNeedAGPRs : 1;

	// The hard-wired high half of the address of the global information table			// The hard-wired high half of the address of the global information table
	// for AMDPAL OS type. 0xffffffff represents no hard-wired high half, since			// for AMDPAL OS type. 0xffffffff represents no hard-wired high half, since
	// current hardware only allows a 16 bit value.			// current hardware only allows a 16 bit value.
	unsigned GITPtrHigh;			unsigned GITPtrHigh;

	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	bool hasImplicitArgPtr() const {			bool hasImplicitArgPtr() const {
	return ImplicitArgPtr;			return ImplicitArgPtr;
	}			}

	bool hasHostcallPtr() const {			bool hasHostcallPtr() const {
	return HostcallPtr;			return HostcallPtr;
	}			}

				bool hasHeapPtr () const {
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - bool hasHeapPtr () const { - return HeapPtr; - } + bool hasHeapPtr() const { return HeapPtr; } Lint: Pre-merge checks: clang-format: please reformat the code ``` - bool hasHeapPtr () const { - return HeapPtr…
				return HeapPtr;
				}

	bool hasImplicitBufferPtr() const {			bool hasImplicitBufferPtr() const {
	return ImplicitBufferPtr;			return ImplicitBufferPtr;
	}			}

	AMDGPUFunctionArgInfo &getArgInfo() {			AMDGPUFunctionArgInfo &getArgInfo() {
	return ArgInfo;			return ArgInfo;
	}			}

	Show All 24 Lines

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp

Context not available.
	#include "llvm/CodeGen/MachineFunction.h"	#include "llvm/CodeGen/MachineFunction.h"
	#include "llvm/CodeGen/MachineRegisterInfo.h"	#include "llvm/CodeGen/MachineRegisterInfo.h"
	#include "llvm/CodeGen/MIRParser/MIParser.h"	#include "llvm/CodeGen/MIRParser/MIParser.h"
	#include "llvm/IR/CallingConv.h"	#include "llvm/IR/CallingConv.h"
	#include "llvm/IR/DiagnosticInfo.h"	#include "llvm/IR/DiagnosticInfo.h"
	#include "llvm/IR/Function.h"	#include "llvm/IR/Function.h"
	#include <cassert>	#include <cassert>
	#include <vector>	#include <vector>

	#define MAX_LANES 64	#define MAX_LANES 64

	using namespace llvm;	using namespace llvm;

	SIMachineFunctionInfo::SIMachineFunctionInfo(const MachineFunction &MF)	SIMachineFunctionInfo::SIMachineFunctionInfo(const MachineFunction &MF)
	: AMDGPUMachineFunction(MF),	: AMDGPUMachineFunction(MF),
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - : AMDGPUMachineFunction(MF), - PrivateSegmentBuffer(false), - DispatchPtr(false), - QueuePtr(false), - KernargSegmentPtr(false), - DispatchID(false), - FlatScratchInit(false), - WorkGroupIDX(false), - WorkGroupIDY(false), - WorkGroupIDZ(false), 20 diff lines are omitted. See full path. Lint: Pre-merge checks: clang-format: please reformat the code ``` - : AMDGPUMachineFunction(MF)…
	PrivateSegmentBuffer(false),	PrivateSegmentBuffer(false),
	DispatchPtr(false),	DispatchPtr(false),
	QueuePtr(false),	QueuePtr(false),
	KernargSegmentPtr(false),	KernargSegmentPtr(false),
	DispatchID(false),	DispatchID(false),
	FlatScratchInit(false),	FlatScratchInit(false),
	WorkGroupIDX(false),	WorkGroupIDX(false),
	WorkGroupIDY(false),	WorkGroupIDY(false),
	WorkGroupIDZ(false),	WorkGroupIDZ(false),
	WorkGroupInfo(false),	WorkGroupInfo(false),
	PrivateSegmentWaveByteOffset(false),	PrivateSegmentWaveByteOffset(false),
	WorkItemIDX(false),	WorkItemIDX(false),
	WorkItemIDY(false),	WorkItemIDY(false),
	WorkItemIDZ(false),	WorkItemIDZ(false),
	ImplicitBufferPtr(false),	ImplicitBufferPtr(false),
	ImplicitArgPtr(false),	ImplicitArgPtr(false),
	HostcallPtr(false),	HostcallPtr(false),
		HeapPtr(false),
	GITPtrHigh(0xffffffff),	GITPtrHigh(0xffffffff),
	HighBitsOf32BitAddress(0),	HighBitsOf32BitAddress(0),
	GDSSize(0) {	GDSSize(0) {
	const GCNSubtarget &ST = MF.getSubtarget<GCNSubtarget>();	const GCNSubtarget &ST = MF.getSubtarget<GCNSubtarget>();
	const Function &F = MF.getFunction();	const Function &F = MF.getFunction();
	FlatWorkGroupSizes = ST.getFlatWorkGroupSizes(F);	FlatWorkGroupSizes = ST.getFlatWorkGroupSizes(F);
	WavesPerEU = ST.getWavesPerEU(F);	WavesPerEU = ST.getWavesPerEU(F);

	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	if (!F.hasFnAttribute("amdgpu-no-queue-ptr"))	if (!F.hasFnAttribute("amdgpu-no-queue-ptr"))
	QueuePtr = true;	QueuePtr = true;

	if (!F.hasFnAttribute("amdgpu-no-dispatch-id"))	if (!F.hasFnAttribute("amdgpu-no-dispatch-id"))
	DispatchID = true;	DispatchID = true;

	if (!F.hasFnAttribute("amdgpu-no-hostcall-ptr"))	if (!F.hasFnAttribute("amdgpu-no-hostcall-ptr"))
	HostcallPtr = true;	HostcallPtr = true;

		if (!F.hasFnAttribute("amdgpu-no-heap-ptr"))
		HeapPtr = true;
	}	}

	// FIXME: This attribute is a hack, we just need an analysis on the function	// FIXME: This attribute is a hack, we just need an analysis on the function
	// to look for allocas.	// to look for allocas.
	bool HasStackObjects = F.hasFnAttribute("amdgpu-stack-objects");	bool HasStackObjects = F.hasFnAttribute("amdgpu-stack-objects");

	// TODO: This could be refined a lot. The attribute is a poor way of	// TODO: This could be refined a lot. The attribute is a poor way of
	// detecting calls or stack objects that may require it before argument	// detecting calls or stack objects that may require it before argument
	Show All 24 Lines

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h

	Show All 24 Lines
	/// false otherwise.			/// false otherwise.
	bool isHsaAbiVersion3(const MCSubtargetInfo *STI);			bool isHsaAbiVersion3(const MCSubtargetInfo *STI);
	/// \returns True if HSA OS ABI Version identification is 4,			/// \returns True if HSA OS ABI Version identification is 4,
	/// false otherwise.			/// false otherwise.
	bool isHsaAbiVersion4(const MCSubtargetInfo *STI);			bool isHsaAbiVersion4(const MCSubtargetInfo *STI);
	/// \returns True if HSA OS ABI Version identification is 5,			/// \returns True if HSA OS ABI Version identification is 5,
	/// false otherwise.			/// false otherwise.
	bool isHsaAbiVersion5(const MCSubtargetInfo *STI);			bool isHsaAbiVersion5(const MCSubtargetInfo *STI);
	/// \returns True if HSA OS ABI Version identification is 3 or 4,			/// \returns True if HSA OS ABI Version identification is 3 and above,
	/// false otherwise.			/// false otherwise.
	bool isHsaAbiVersion3AndAbove(const MCSubtargetInfo *STI);			bool isHsaAbiVersion3AndAbove(const MCSubtargetInfo *STI);

	/// \returns The offset of the hostcall pointer argument from implicitarg_ptr			/// \returns the offset of the hostcall pointer argument from implicitarg_ptr
	sameerdsUnsubmitted Not Done Reply Inline Actions The first letter should be capital. `\returns` is not the beginning of the sentence ... doxygen will render only the stuff that follows the keyword. sameerds: The first letter should be capital. `\returns` is not the beginning of the sentence ... doxygen…
	cfangAuthorUnsubmitted Done Reply Inline Actions I see. Thanks for pointing out. cfang: I see. Thanks for pointing out.
	unsigned getHostcallImplicitArgPosition();			unsigned getHostcallImplicitArgPosition();

				/// \returns the offset of the heap ptr argument from implicitarg_ptr
				unsigned getHeapPtrImplicitArgPosition();

				/// \returns code object version.
				unsigned getAmdhsaCodeObjectVersion();

	struct GcnBufferFormatInfo {			struct GcnBufferFormatInfo {
	unsigned Format;			unsigned Format;
	unsigned BitsPerComp;			unsigned BitsPerComp;
	unsigned NumComponents;			unsigned NumComponents;
	unsigned NumFormat;			unsigned NumFormat;
	unsigned DataFormat;			unsigned DataFormat;
	};			};

	Show All 24 Lines

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp

	Show All 24 Lines
	return false;			return false;
	}			}

	bool isHsaAbiVersion3AndAbove(const MCSubtargetInfo *STI) {			bool isHsaAbiVersion3AndAbove(const MCSubtargetInfo *STI) {
	return isHsaAbiVersion3(STI) \|\| isHsaAbiVersion4(STI) \|\|			return isHsaAbiVersion3(STI) \|\| isHsaAbiVersion4(STI) \|\|
	isHsaAbiVersion5(STI);			isHsaAbiVersion5(STI);
	}			}

				unsigned getAmdhsaCodeObjectVersion() {
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code -unsigned getAmdhsaCodeObjectVersion() { - return AmdhsaCodeObjectVersion; -} +unsigned getAmdhsaCodeObjectVersion() { return AmdhsaCodeObjectVersion; } Lint: Pre-merge checks: clang-format: please reformat the code ``` -unsigned getAmdhsaCodeObjectVersion() { - return…
				return AmdhsaCodeObjectVersion;
				}

	// FIXME: All such magic numbers about the ABI should be in a			// FIXME: All such magic numbers about the ABI should be in a
	// central TD file.			// central TD file.
	unsigned getHostcallImplicitArgPosition() {			unsigned getHostcallImplicitArgPosition() {
	switch (AmdhsaCodeObjectVersion) {			switch (AmdhsaCodeObjectVersion) {
	case 2:			case 2:
	case 3:			case 3:
	case 4:			case 4:
	return 24;			return 24;
	case 5:			case 5:
	return 80;			return 80;
	default:			default:
	llvm_unreachable("Unexpected code object version");			llvm_unreachable("Unexpected code object version");
	return 0;			return 0;
	}			}
	}			}

				unsigned getHeapPtrImplicitArgPosition() {
				if (AmdhsaCodeObjectVersion == 5)
				return 96;
				llvm_unreachable("hidden_heap is supported only by code object version 5");
				return 0;
				}

	#define GET_MIMGBaseOpcodesTable_IMPL			#define GET_MIMGBaseOpcodesTable_IMPL
	#define GET_MIMGDimInfoTable_IMPL			#define GET_MIMGDimInfoTable_IMPL
	#define GET_MIMGInfoTable_IMPL			#define GET_MIMGInfoTable_IMPL
	#define GET_MIMGLZMappingTable_IMPL			#define GET_MIMGLZMappingTable_IMPL
	#define GET_MIMGMIPMappingTable_IMPL			#define GET_MIMGMIPMappingTable_IMPL
	#define GET_MIMGBiasMappingTable_IMPL			#define GET_MIMGBiasMappingTable_IMPL
	#define GET_MIMGOffsetMappingTable_IMPL			#define GET_MIMGOffsetMappingTable_IMPL
	#define GET_MIMGG16MappingTable_IMPL			#define GET_MIMGG16MappingTable_IMPL
	Show All 24 Lines

llvm/test/CodeGen/AMDGPU/addrspacecast-constantexpr.ll

	Show All 24 Lines

	attributes #0 = { argmemonly nounwind }	attributes #0 = { argmemonly nounwind }
	attributes #1 = { nounwind }	attributes #1 = { nounwind }
	;.	;.
	; AKF_HSA: attributes #[[ATTR0:[0-9]+]] = { argmemonly nofree nounwind willreturn }	; AKF_HSA: attributes #[[ATTR0:[0-9]+]] = { argmemonly nofree nounwind willreturn }
	; AKF_HSA: attributes #[[ATTR1]] = { nounwind }	; AKF_HSA: attributes #[[ATTR1]] = { nounwind }
	;.	;.
	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { argmemonly nofree nounwind willreturn }	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { argmemonly nofree nounwind willreturn }
	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa-call.ll

	Show All 24 Lines
	; AKF_HSA: attributes #[[ATTR1]] = { nounwind "target-cpu"="fiji" }	; AKF_HSA: attributes #[[ATTR1]] = { nounwind "target-cpu"="fiji" }
	; AKF_HSA: attributes #[[ATTR2]] = { nounwind "target-cpu"="gfx900" }	; AKF_HSA: attributes #[[ATTR2]] = { nounwind "target-cpu"="gfx900" }
	; AKF_HSA: attributes #[[ATTR3]] = { nounwind }	; AKF_HSA: attributes #[[ATTR3]] = { nounwind }
	; AKF_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-calls" }	; AKF_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-calls" }
	; AKF_HSA: attributes #[[ATTR5]] = { nounwind sanitize_address }	; AKF_HSA: attributes #[[ATTR5]] = { nounwind sanitize_address }
	; AKF_HSA: attributes #[[ATTR6:[0-9]+]] = { nounwind sanitize_address "amdgpu-no-implicitarg-ptr" }	; AKF_HSA: attributes #[[ATTR6:[0-9]+]] = { nounwind sanitize_address "amdgpu-no-implicitarg-ptr" }
	;.	;.
	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR10]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR10]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR11]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR11]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR12]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="gfx900" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR12]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="gfx900" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR13]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="gfx900" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR13]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="gfx900" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR14]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR14]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "target-cpu"="fiji" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR15]] = { nounwind "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR15]] = { nounwind "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR16]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR16]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR17]] = { nounwind sanitize_address "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR17]] = { nounwind sanitize_address "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR18]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR18]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR19:[0-9]+]] = { nounwind sanitize_address "amdgpu-no-implicitarg-ptr" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR19:[0-9]+]] = { nounwind sanitize_address "amdgpu-no-implicitarg-ptr" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR20]] = { nounwind }	; ATTRIBUTOR_HSA: attributes #[[ATTR20]] = { nounwind }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa.ll

	Show All 24 Lines
	attributes #1 = { nounwind }	attributes #1 = { nounwind }

	;.	;.
	; AKF_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }	; AKF_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; AKF_HSA: attributes #[[ATTR1]] = { nounwind }	; AKF_HSA: attributes #[[ATTR1]] = { nounwind }
	; AKF_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-stack-objects" }	; AKF_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-stack-objects" }
	;.	;.
	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }	; ATTRIBUTOR_HSA: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR10]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR10]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_HSA: attributes #[[ATTR11]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_HSA: attributes #[[ATTR11]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/annotate-kernel-features.ll

	Show All 24 Lines
	; NOHSA: attributes #[[ATTR7]] = { nounwind "amdgpu-work-group-id-y" "amdgpu-work-item-id-y" "uniform-work-group-size"="false" }	; NOHSA: attributes #[[ATTR7]] = { nounwind "amdgpu-work-group-id-y" "amdgpu-work-item-id-y" "uniform-work-group-size"="false" }
	; NOHSA: attributes #[[ATTR8]] = { nounwind "amdgpu-work-item-id-y" "amdgpu-work-item-id-z" "uniform-work-group-size"="false" }	; NOHSA: attributes #[[ATTR8]] = { nounwind "amdgpu-work-item-id-y" "amdgpu-work-item-id-z" "uniform-work-group-size"="false" }
	; NOHSA: attributes #[[ATTR9]] = { nounwind "amdgpu-work-group-id-y" "amdgpu-work-group-id-z" "amdgpu-work-item-id-y" "amdgpu-work-item-id-z" "uniform-work-group-size"="false" }	; NOHSA: attributes #[[ATTR9]] = { nounwind "amdgpu-work-group-id-y" "amdgpu-work-group-id-z" "amdgpu-work-item-id-y" "amdgpu-work-item-id-z" "uniform-work-group-size"="false" }
	;.	;.
	; AKF_CHECK: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }	; AKF_CHECK: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; AKF_CHECK: attributes #[[ATTR1]] = { nounwind }	; AKF_CHECK: attributes #[[ATTR1]] = { nounwind }
	;.	;.
	; ATTRIBUTOR_CHECK: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }	; ATTRIBUTOR_CHECK: attributes #[[ATTR0:[0-9]+]] = { nounwind readnone speculatable willreturn }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR1]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR2]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR3]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR4]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR5]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR6]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR7]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR8]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_CHECK: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }	; ATTRIBUTOR_CHECK: attributes #[[ATTR9]] = { nounwind "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workitem-id-x" "uniform-work-group-size"="false" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/direct-indirect-call.ll

	Show All 24 Lines
	; CHECK-SAME: () #[[ATTR1]] {	; CHECK-SAME: () #[[ATTR1]] {
	; CHECK-NEXT: call void @direct()	; CHECK-NEXT: call void @direct()
	; CHECK-NEXT: ret void	; CHECK-NEXT: ret void
	;	;
	call void @direct()	call void @direct()
	ret void	ret void
	}	}
	;.	;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR1]] = { "uniform-work-group-size"="false" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/duplicate-attribute-indirect.ll

	Show All 24 Lines
	ret void	ret void
	}	}

	attributes #0 = { "amdgpu-no-dispatch-id" }	attributes #0 = { "amdgpu-no-dispatch-id" }

	;.	;.
	; AKF_GCN: attributes #[[ATTR0]] = { "amdgpu-calls" "amdgpu-no-dispatch-id" "amdgpu-stack-objects" }	; AKF_GCN: attributes #[[ATTR0]] = { "amdgpu-calls" "amdgpu-no-dispatch-id" "amdgpu-stack-objects" }
	;.	;.
	; ATTRIBUTOR_GCN: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_GCN: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_GCN: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "uniform-work-group-size"="false" }	; ATTRIBUTOR_GCN: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "uniform-work-group-size"="false" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/hsa-metadata-heap-v5.ll

This file was added.

				; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx900 --amdhsa-code-object-version=5 -filetype=obj -o - < %s \| llvm-readelf --notes - \| FileCheck %s
				; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx900 --amdhsa-code-object-version=5 < %s \| FileCheck --check-prefix=CHECK %s

				declare void @function1()

				declare void @function2() #0

				; Function Attrs: noinline
				define void @function3(i8 addrspace(4)* %argptr, i8 addrspace(4)* addrspace(1)* %sink) #2 {
				store i8 addrspace(4)* %argptr, i8 addrspace(4)* addrspace(1)* %sink, align 8
				ret void
				}

				; Function Attrs: noinline
				define void @function4(i64 %arg, i64* %a) #2 {
				store i64 %arg, i64* %a
				ret void
				}

				; Function Attrs: noinline
				define void @function5(i8 addrspace(4)* %ptr, i64* %sink) #2 {
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 64
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %sink
				ret void
				}

				; Function Attrs: nounwind readnone speculatable willreturn
				declare align 4 i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr() #1

				; CHECK: amdhsa.kernels:
				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel10
				define amdgpu_kernel void @test_kernel10(i8* %a) {
				store i8 3, i8* %a, align 1
				ret void
				}

				; Call to an extern function

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel20
				define amdgpu_kernel void @test_kernel20(i8* %a) {
				call void @function1()
				store i8 3, i8* %a, align 1
				ret void
				}

				; Explicit attribute on kernel

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel21
				define amdgpu_kernel void @test_kernel21(i8* %a) #0 {
				call void @function1()
				store i8 3, i8* %a, align 1
				ret void
				}

				; Explicit attribute on extern callee

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel22
				define amdgpu_kernel void @test_kernel22(i8* %a) {
				call void @function2()
				store i8 3, i8* %a, align 1
				ret void
				}

				; Access more bytes than the pointer size

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel30
				define amdgpu_kernel void @test_kernel30(i128* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 88
				%cast = bitcast i8 addrspace(4)* %gep to i128 addrspace(4)*
				%x = load i128, i128 addrspace(4)* %cast
				store i128 %x, i128* %a
				ret void
				}

				; Typical load of heap buffer pointer

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel40
				define amdgpu_kernel void @test_kernel40(i64* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 96
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %a
				ret void
				}

				; Typical usage, overriden by explicit attribute on kernel

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel41
				define amdgpu_kernel void @test_kernel41(i64* %a) #0 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 96
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %a
				ret void
				}

				; Access to implicit arg before the heap pointer

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel42
				define amdgpu_kernel void @test_kernel42(i64* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 88
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %a
				ret void
				}

				; Access to implicit arg after the heap pointer

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel43
				define amdgpu_kernel void @test_kernel43(i64* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 104
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				store i64 %x, i64* %a
				ret void
				}

				; Accessing a byte just before the heap pointer

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel44
				define amdgpu_kernel void @test_kernel44(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 95
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Accessing a byte inside the heap pointer

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel45
				define amdgpu_kernel void @test_kernel45(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 96
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Accessing a byte inside the heap pointer

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel46
				define amdgpu_kernel void @test_kernel46(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 103
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Accessing a byte just after the heap pointer

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel47
				define amdgpu_kernel void @test_kernel47(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 104
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Access with an unknown offset

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel50
				define amdgpu_kernel void @test_kernel50(i8* %a, i32 %b) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i32 %b
				%x = load i8, i8 addrspace(4)* %gep, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Multiple geps reaching the heap pointer argument.

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel51
				define amdgpu_kernel void @test_kernel51(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep1 = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 16
				%gep2 = getelementptr inbounds i8, i8 addrspace(4)* %gep1, i64 80
				%x = load i8, i8 addrspace(4)* %gep2, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Multiple geps not reaching the heap pointer argument.

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel52
				define amdgpu_kernel void @test_kernel52(i8* %a) {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep1 = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 16
				%gep2 = getelementptr inbounds i8, i8 addrspace(4)* %gep1, i64 16
				%x = load i8, i8 addrspace(4)* %gep2, align 1
				store i8 %x, i8* %a, align 1
				ret void
				}

				; Heap pointer used inside a function call

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel60
				define amdgpu_kernel void @test_kernel60(i64* %a) #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 96
				%cast = bitcast i8 addrspace(4)* %gep to i64 addrspace(4)*
				%x = load i64, i64 addrspace(4)* %cast
				call void @function4(i64 %x, i64* %a)
				ret void
				}

				; Heap pointer retrieved inside a function call; chain of geps

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel61
				define amdgpu_kernel void @test_kernel61(i64* %a) #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i64 32
				call void @function5(i8 addrspace(4)* %gep, i64* %a)
				ret void
				}

				; Pointer captured

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel70
				define amdgpu_kernel void @test_kernel70(i8 addrspace(4)* addrspace(1)* %sink) #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i32 42
				store i8 addrspace(4)* %gep, i8 addrspace(4)* addrspace(1)* %sink, align 8
				ret void
				}

				; Pointer captured inside function call

				; CHECK: - .args:
				; CHECK: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel71
				define amdgpu_kernel void @test_kernel71(i8 addrspace(4)* addrspace(1)* %sink) #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i32 42
				call void @function3(i8 addrspace(4)* %gep, i8 addrspace(4)* addrspace(1)* %sink)
				ret void
				}

				; Ineffective pointer capture

				; CHECK: - .args:
				; CHECK-NOT: hidden_heap_v1
				; CHECK-LABEL: .name: test_kernel72
				define amdgpu_kernel void @test_kernel72() #2 {
				%ptr = tail call i8 addrspace(4)* @llvm.amdgcn.implicitarg.ptr()
				%gep = getelementptr inbounds i8, i8 addrspace(4)* %ptr, i32 42
				store i8 addrspace(4)* %gep, i8 addrspace(4)* addrspace(1)* undef, align 8
				ret void
				}

				attributes #0 = { "amdgpu-no-heap-ptr" }
				attributes #1 = { nounwind readnone speculatable willreturn }
				attributes #2 = { noinline }

llvm/test/CodeGen/AMDGPU/hsa-metadata-hidden-args-v5.ll

	Show All 24 Lines
	; CHECK-NEXT: - .address_space: global			; CHECK-NEXT: - .address_space: global
	; CHECK-NEXT: .offset: 104			; CHECK-NEXT: .offset: 104
	; CHECK-NEXT: .size: 8			; CHECK-NEXT: .size: 8
	; CHECK-NEXT: .value_kind: hidden_hostcall_buffer			; CHECK-NEXT: .value_kind: hidden_hostcall_buffer
	; CHECK-NEXT: - .address_space: global			; CHECK-NEXT: - .address_space: global
	; CHECK-NEXT: .offset: 112			; CHECK-NEXT: .offset: 112
	; CHECK-NEXT: .size: 8			; CHECK-NEXT: .size: 8
	; CHECK-NEXT: .value_kind: hidden_multigrid_sync_arg			; CHECK-NEXT: .value_kind: hidden_multigrid_sync_arg
				; CHECK-NEXT: - .address_space: global
				; CHECK-NEXT: .offset: 120
				; CHECK-NEXT: .size: 8
				; CHECK-NEXT: .value_kind: hidden_heap_v1
	; CHECK-NEXT: - .address_space: global			; CHECK-NEXT: - .address_space: global
	; CHECK-NEXT: .offset: 128			; CHECK-NEXT: .offset: 128
	; CHECK-NEXT: .size: 8			; CHECK-NEXT: .size: 8
	; CHECK-NEXT: .value_kind: hidden_default_queue			; CHECK-NEXT: .value_kind: hidden_default_queue
	; CHECK-NEXT: - .address_space: global			; CHECK-NEXT: - .address_space: global
	; CHECK-NEXT: .offset: 136			; CHECK-NEXT: .offset: 136
	; CHECK-NEXT: .size: 8			; CHECK-NEXT: .size: 8
	; CHECK-NEXT: .value_kind: hidden_completion_action			; CHECK-NEXT: .value_kind: hidden_completion_action
	Show All 24 Lines

llvm/test/CodeGen/AMDGPU/propagate-flat-work-group-size.ll

	Show All 24 Lines
	attributes #1 = { "amdgpu-flat-work-group-size"="64,128" }	attributes #1 = { "amdgpu-flat-work-group-size"="64,128" }
	attributes #2 = { "amdgpu-flat-work-group-size"="64,64" }	attributes #2 = { "amdgpu-flat-work-group-size"="64,64" }
	attributes #3 = { "amdgpu-flat-work-group-size"="128,256" }	attributes #3 = { "amdgpu-flat-work-group-size"="128,256" }
	attributes #4 = { "amdgpu-flat-work-group-size"="512,1024" }	attributes #4 = { "amdgpu-flat-work-group-size"="512,1024" }
	attributes #5 = { "amdgpu-flat-work-group-size"="128,512" }	attributes #5 = { "amdgpu-flat-work-group-size"="128,512" }
	attributes #6 = { "amdgpu-flat-work-group-size"="512,512" }	attributes #6 = { "amdgpu-flat-work-group-size"="512,512" }
	attributes #7 = { "amdgpu-flat-work-group-size"="64,256" }	attributes #7 = { "amdgpu-flat-work-group-size"="64,256" }
	;.	;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-flat-work-group-size"="1,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR0]] = { "amdgpu-flat-work-group-size"="1,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "amdgpu-flat-work-group-size"="64,128" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR1]] = { "amdgpu-flat-work-group-size"="64,128" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR2]] = { "amdgpu-flat-work-group-size"="128,512" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR2]] = { "amdgpu-flat-work-group-size"="128,512" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR3]] = { "amdgpu-flat-work-group-size"="64,64" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR3]] = { "amdgpu-flat-work-group-size"="64,64" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR4]] = { "amdgpu-flat-work-group-size"="128,128" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR4]] = { "amdgpu-flat-work-group-size"="128,128" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR5]] = { "amdgpu-flat-work-group-size"="512,512" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR5]] = { "amdgpu-flat-work-group-size"="512,512" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR6]] = { "amdgpu-flat-work-group-size"="64,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR6]] = { "amdgpu-flat-work-group-size"="64,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR7]] = { "amdgpu-flat-work-group-size"="128,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR7]] = { "amdgpu-flat-work-group-size"="128,256" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR8]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR8]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/simple-indirect-call.ll

	Show All 24 Lines
	%fp = load void(), void()* %fptr.cast	%fp = load void(), void()* %fptr.cast
	call void %fp()	call void %fp()
	ret void	ret void
	}	}

	;.	;.
	; AKF_GCN: attributes #[[ATTR0]] = { "amdgpu-calls" "amdgpu-stack-objects" }	; AKF_GCN: attributes #[[ATTR0]] = { "amdgpu-calls" "amdgpu-stack-objects" }
	;.	;.
	; ATTRIBUTOR_GCN: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; ATTRIBUTOR_GCN: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; ATTRIBUTOR_GCN: attributes #[[ATTR1]] = { "uniform-work-group-size"="false" }	; ATTRIBUTOR_GCN: attributes #[[ATTR1]] = { "uniform-work-group-size"="false" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/uniform-work-group-attribute-missing.ll

	Show All 24 Lines
	; CHECK-NEXT: ret void	; CHECK-NEXT: ret void
	;	;
	call void @foo()	call void @foo()
	ret void	ret void
	}	}

	attributes #0 = { "uniform-work-group-size"="true" }	attributes #0 = { "uniform-work-group-size"="true" }
	;.	;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/uniform-work-group-multistep.ll

	Show All 24 Lines
	; CHECK-NEXT: ret void	; CHECK-NEXT: ret void
	;	;
	call void @internal2()	call void @internal2()
	ret void	ret void
	}	}

	attributes #0 = { "uniform-work-group-size"="true" }	attributes #0 = { "uniform-work-group-size"="true" }
	;.	;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/uniform-work-group-nested-function-calls.ll

	Show All 24 Lines
	; CHECK-NEXT: ret void	; CHECK-NEXT: ret void
	;	;
	call void @func2()	call void @func2()
	ret void	ret void
	}	}

	attributes #2 = { "uniform-work-group-size"="true" }	attributes #2 = { "uniform-work-group-size"="true" }
	;.	;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/uniform-work-group-prevent-attribute-propagation.ll

	Show All 24 Lines
	; CHECK-NEXT: ret void	; CHECK-NEXT: ret void
	;	;
	call void @func()	call void @func()
	ret void	ret void
	}	}

	attributes #1 = { "uniform-work-group-size"="true" }	attributes #1 = { "uniform-work-group-size"="true" }
	;.	;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }	; CHECK: attributes #[[ATTR1]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/uniform-work-group-recursion-test.ll

	Show All 24 Lines
	store i32 %r2, i32 addrspace(1)* %m	store i32 %r2, i32 addrspace(1)* %m
	ret void	ret void
	}	}

	; nounwind and readnone are added to match attributor results.	; nounwind and readnone are added to match attributor results.
	attributes #0 = { nounwind readnone }	attributes #0 = { nounwind readnone }
	attributes #1 = { "uniform-work-group-size"="true" }	attributes #1 = { "uniform-work-group-size"="true" }
	;.	;.
	; CHECK: attributes #[[ATTR0]] = { nounwind readnone "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR0]] = { nounwind readnone "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	; CHECK: attributes #[[ATTR1]] = { nounwind readnone "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }	; CHECK: attributes #[[ATTR1]] = { nounwind readnone "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
	; CHECK: attributes #[[ATTR2]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }	; CHECK: attributes #[[ATTR2]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="true" }
	;.	;.
Context not available.

llvm/test/CodeGen/AMDGPU/uniform-work-group-test.ll

	Show All 24 Lines
	;	;
	call void @func2()	call void @func2()
	call void @func3()	call void @func3()
	ret void	ret void
	}	}

	attributes #0 = { "uniform-work-group-size"="false" }	attributes #0 = { "uniform-work-group-size"="false" }
	;.	;.
	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }	; CHECK: attributes #[[ATTR0]] = { "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "uniform-work-group-size"="false" }
	;.	;.
Context not available.

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][NFC]: Emit metadata for hidden_heap_v1 kernarg
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 410338

llvm/docs/AMDGPUUsage.rst

llvm/lib/BinaryFormat/AMDGPUMetadataVerifier.cpp

llvm/lib/Target/AMDGPU/AMDGPUAttributes.def

llvm/lib/Target/AMDGPU/AMDGPUAttributor.cpp

llvm/lib/Target/AMDGPU/AMDGPUHSAMetadataStreamer.cpp

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp

llvm/test/CodeGen/AMDGPU/addrspacecast-constantexpr.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa-call.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features.ll

llvm/test/CodeGen/AMDGPU/direct-indirect-call.ll

llvm/test/CodeGen/AMDGPU/duplicate-attribute-indirect.ll

llvm/test/CodeGen/AMDGPU/hsa-metadata-heap-v5.ll

llvm/test/CodeGen/AMDGPU/hsa-metadata-hidden-args-v5.ll

llvm/test/CodeGen/AMDGPU/propagate-flat-work-group-size.ll

llvm/test/CodeGen/AMDGPU/simple-indirect-call.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-attribute-missing.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-multistep.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-nested-function-calls.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-prevent-attribute-propagation.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-recursion-test.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-test.ll

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][NFC]: Emit metadata for hidden_heap_v1 kernargClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 410338

llvm/docs/AMDGPUUsage.rst

llvm/lib/BinaryFormat/AMDGPUMetadataVerifier.cpp

llvm/lib/Target/AMDGPU/AMDGPUAttributes.def

llvm/lib/Target/AMDGPU/AMDGPUAttributor.cpp

llvm/lib/Target/AMDGPU/AMDGPUHSAMetadataStreamer.cpp

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h

llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h

llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp

llvm/test/CodeGen/AMDGPU/addrspacecast-constantexpr.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa-call.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features-hsa.ll

llvm/test/CodeGen/AMDGPU/annotate-kernel-features.ll

llvm/test/CodeGen/AMDGPU/direct-indirect-call.ll

llvm/test/CodeGen/AMDGPU/duplicate-attribute-indirect.ll

llvm/test/CodeGen/AMDGPU/hsa-metadata-heap-v5.ll

llvm/test/CodeGen/AMDGPU/hsa-metadata-hidden-args-v5.ll

llvm/test/CodeGen/AMDGPU/propagate-flat-work-group-size.ll

llvm/test/CodeGen/AMDGPU/simple-indirect-call.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-attribute-missing.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-multistep.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-nested-function-calls.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-prevent-attribute-propagation.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-recursion-test.ll

llvm/test/CodeGen/AMDGPU/uniform-work-group-test.ll

[AMDGPU][NFC]: Emit metadata for hidden_heap_v1 kernarg
ClosedPublic