Diff 413637

llvm/docs/AMDGPUUsage.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,445 Lines • ▼ Show 20 Lines
..		..

.. table:: compute_pgm_rsrc3 for GFX10		.. table:: compute_pgm_rsrc3 for GFX10
:name: amdgpu-amdhsa-compute_pgm_rsrc3-gfx10-table		:name: amdgpu-amdhsa-compute_pgm_rsrc3-gfx10-table

======= ======= =============================== ===========================================================================		======= ======= =============================== ===========================================================================
Bits Size Field Name Description		Bits Size Field Name Description
======= ======= =============================== ===========================================================================		======= ======= =============================== ===========================================================================
3:0 4 bits SHARED_VGPR_COUNT Number of shared VGPRs for wavefront size 64. Granularity 8. Value 0-120.		3:0 4 bits SHARED_VGPR_COUNT Number of shared VGPR blocks when executing in subvector mode. For
compute_pgm_rsrc1.vgprs + shared_vgpr_cnt cannot exceed 64.		wavefront size 64 the value is 0-15, representing 0-120 VGPRs (granularity
		of 8), such that (compute_pgm_rsrc1.vgprs +1)4 + shared_vgpr_count8 does
		not exceed 256. For wavefront size 32 shared_vgpr_count must be 0.
31:4 28 Reserved, must be 0.		31:4 28 Reserved, must be 0.
		t-tyeUnsubmitted Done Reply Inline Actions Suggest reword to: Number of shared VGPR blocks for wavefront size 64 when executing in subvector mode. For wavefront size 64 the value is 0-15, representing 0-120 VGPRs (granularity of 8), such that (compute_pgm_rsrc1.vgprs +1)4 + shared_vgpr_count8 does not exceed 256. For wavefront size 32 must be 0. t-tye: Suggest reword to: ``` Number of shared VGPR blocks for wavefront size 64 when executing in…
bits		bits
32 Total size 4 bytes.		32 Total size 4 bytes.
======= ===================================================================================================================		======= ===================================================================================================================

..		..

.. table:: Floating Point Rounding Mode Enumeration Values		.. table:: Floating Point Rounding Mode Enumeration Values
:name: amdgpu-amdhsa-floating-point-rounding-mode-enumeration-values-table		:name: amdgpu-amdhsa-floating-point-rounding-mode-enumeration-values-table
▲ Show 20 Lines • Show All 7,902 Lines • ▼ Show 20 Lines	.. table:: AMDHSA Kernel Assembler Directives
``.amdhsa_workgroup_processor_mode`` Target GFX10 Controls ENABLE_WGP_MODE in		``.amdhsa_workgroup_processor_mode`` Target GFX10 Controls ENABLE_WGP_MODE in
Feature :ref:`amdgpu-amdhsa-kernel-descriptor-v3-table`.		Feature :ref:`amdgpu-amdhsa-kernel-descriptor-v3-table`.
Specific		Specific
(cumode)		(cumode)
``.amdhsa_memory_ordered`` 1 GFX10 Controls MEM_ORDERED in		``.amdhsa_memory_ordered`` 1 GFX10 Controls MEM_ORDERED in
:ref:`amdgpu-amdhsa-compute_pgm_rsrc1-gfx6-gfx10-table`.		:ref:`amdgpu-amdhsa-compute_pgm_rsrc1-gfx6-gfx10-table`.
``.amdhsa_forward_progress`` 0 GFX10 Controls FWD_PROGRESS in		``.amdhsa_forward_progress`` 0 GFX10 Controls FWD_PROGRESS in
:ref:`amdgpu-amdhsa-compute_pgm_rsrc1-gfx6-gfx10-table`.		:ref:`amdgpu-amdhsa-compute_pgm_rsrc1-gfx6-gfx10-table`.
		``.amdhsa_shared_vgpr_count`` 0 GFX10 Controls SHARED_VGPR_COUNT in
		rampitecUnsubmitted Done Reply Inline Actions Should probably specify it is GFX10 only. rampitec: Should probably specify it is GFX10 only.
		kzhuravlUnsubmitted Done Reply Inline Actions It already says "GFX10" in one of the columns. kzhuravl: It already says "GFX10" in one of the columns.
		:ref:`amdgpu-amdhsa-compute_pgm_rsrc3-gfx10-table`.
``.amdhsa_exception_fp_ieee_invalid_op`` 0 GFX6-GFX10 Controls ENABLE_EXCEPTION_IEEE_754_FP_INVALID_OPERATION in		``.amdhsa_exception_fp_ieee_invalid_op`` 0 GFX6-GFX10 Controls ENABLE_EXCEPTION_IEEE_754_FP_INVALID_OPERATION in
:ref:`amdgpu-amdhsa-compute_pgm_rsrc2-gfx6-gfx10-table`.		:ref:`amdgpu-amdhsa-compute_pgm_rsrc2-gfx6-gfx10-table`.
``.amdhsa_exception_fp_denorm_src`` 0 GFX6-GFX10 Controls ENABLE_EXCEPTION_FP_DENORMAL_SOURCE in		``.amdhsa_exception_fp_denorm_src`` 0 GFX6-GFX10 Controls ENABLE_EXCEPTION_FP_DENORMAL_SOURCE in
:ref:`amdgpu-amdhsa-compute_pgm_rsrc2-gfx6-gfx10-table`.		:ref:`amdgpu-amdhsa-compute_pgm_rsrc2-gfx6-gfx10-table`.
``.amdhsa_exception_fp_ieee_div_zero`` 0 GFX6-GFX10 Controls ENABLE_EXCEPTION_IEEE_754_FP_DIVISION_BY_ZERO in		``.amdhsa_exception_fp_ieee_div_zero`` 0 GFX6-GFX10 Controls ENABLE_EXCEPTION_IEEE_754_FP_DIVISION_BY_ZERO in
:ref:`amdgpu-amdhsa-compute_pgm_rsrc2-gfx6-gfx10-table`.		:ref:`amdgpu-amdhsa-compute_pgm_rsrc2-gfx6-gfx10-table`.
``.amdhsa_exception_fp_ieee_overflow`` 0 GFX6-GFX10 Controls ENABLE_EXCEPTION_IEEE_754_FP_OVERFLOW in		``.amdhsa_exception_fp_ieee_overflow`` 0 GFX6-GFX10 Controls ENABLE_EXCEPTION_IEEE_754_FP_OVERFLOW in
:ref:`amdgpu-amdhsa-compute_pgm_rsrc2-gfx6-gfx10-table`.		:ref:`amdgpu-amdhsa-compute_pgm_rsrc2-gfx6-gfx10-table`.
▲ Show 20 Lines • Show All 192 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,639 Lines • ▼ Show 20 Lines	bool AMDGPUAsmParser::ParseDirectiveAMDHSAKernel() {

StringSet<> Seen;		StringSet<> Seen;

IsaVersion IVersion = getIsaVersion(getSTI().getCPU());		IsaVersion IVersion = getIsaVersion(getSTI().getCPU());

SMRange VGPRRange;		SMRange VGPRRange;
uint64_t NextFreeVGPR = 0;		uint64_t NextFreeVGPR = 0;
uint64_t AccumOffset = 0;		uint64_t AccumOffset = 0;
		uint64_t SharedVGPRCount = 0;
SMRange SGPRRange;		SMRange SGPRRange;
uint64_t NextFreeSGPR = 0;		uint64_t NextFreeSGPR = 0;

// Count the number of user SGPRs implied from the enabled feature bits.		// Count the number of user SGPRs implied from the enabled feature bits.
unsigned ImpliedUserSGPRCount = 0;		unsigned ImpliedUserSGPRCount = 0;

// Track if the asm explicitly contains the directive for the user SGPR		// Track if the asm explicitly contains the directive for the user SGPR
// count.		// count.
▲ Show 20 Lines • Show All 211 Lines • ▼ Show 20 Lines	if (ID == ".amdhsa_group_segment_fixed_size") {
return Error(IDRange.Start, "directive requires gfx10+", IDRange);		return Error(IDRange.Start, "directive requires gfx10+", IDRange);
PARSE_BITS_ENTRY(KD.compute_pgm_rsrc1, COMPUTE_PGM_RSRC1_MEM_ORDERED, Val,		PARSE_BITS_ENTRY(KD.compute_pgm_rsrc1, COMPUTE_PGM_RSRC1_MEM_ORDERED, Val,
ValRange);		ValRange);
} else if (ID == ".amdhsa_forward_progress") {		} else if (ID == ".amdhsa_forward_progress") {
if (IVersion.Major < 10)		if (IVersion.Major < 10)
return Error(IDRange.Start, "directive requires gfx10+", IDRange);		return Error(IDRange.Start, "directive requires gfx10+", IDRange);
PARSE_BITS_ENTRY(KD.compute_pgm_rsrc1, COMPUTE_PGM_RSRC1_FWD_PROGRESS, Val,		PARSE_BITS_ENTRY(KD.compute_pgm_rsrc1, COMPUTE_PGM_RSRC1_FWD_PROGRESS, Val,
ValRange);		ValRange);
		} else if (ID == ".amdhsa_shared_vgpr_count") {
		if (IVersion.Major < 10)
		return Error(IDRange.Start, "directive requires gfx10+", IDRange);
		SharedVGPRCount = Val;
		PARSE_BITS_ENTRY(KD.compute_pgm_rsrc3,
		COMPUTE_PGM_RSRC3_GFX10_SHARED_VGPR_COUNT, Val,
		t-tyeUnsubmitted Done Reply Inline Actions How does the value relate to the total VGPR count? Are these in addition, or this just indicates how many of the VGPRs are shared? If the later, is there an error for when this value exceeds the requested VGPR count? How does new_vgpr work? Is it agnostic to which VGPRs are private and shared? Seems it probably is so that present no issues with adding shared VGPRs but wanted to check. t-tye: How does the value relate to the total VGPR count? Are these in addition, or this just…
		kzhuravlUnsubmitted Done Reply Inline Actions Will post a follow up patch shortly. kzhuravl: Will post a follow up patch shortly.
		lamb-jAuthorUnsubmitted Done Reply Inline Actions I've added a check/error for when shared_vgpr_count exceeds next_free_vgpr. I'm still looking into "new_vgpr" and if any changes need to be made related to that. lamb-j: I've added a check/error for when shared_vgpr_count exceeds next_free_vgpr. I'm still looking…
		ValRange);
} else if (ID == ".amdhsa_exception_fp_ieee_invalid_op") {		} else if (ID == ".amdhsa_exception_fp_ieee_invalid_op") {
PARSE_BITS_ENTRY(		PARSE_BITS_ENTRY(
KD.compute_pgm_rsrc2,		KD.compute_pgm_rsrc2,
COMPUTE_PGM_RSRC2_ENABLE_EXCEPTION_IEEE_754_FP_INVALID_OPERATION, Val,		COMPUTE_PGM_RSRC2_ENABLE_EXCEPTION_IEEE_754_FP_INVALID_OPERATION, Val,
ValRange);		ValRange);
} else if (ID == ".amdhsa_exception_fp_denorm_src") {		} else if (ID == ".amdhsa_exception_fp_denorm_src") {
PARSE_BITS_ENTRY(KD.compute_pgm_rsrc2,		PARSE_BITS_ENTRY(KD.compute_pgm_rsrc2,
COMPUTE_PGM_RSRC2_ENABLE_EXCEPTION_FP_DENORMAL_SOURCE,		COMPUTE_PGM_RSRC2_ENABLE_EXCEPTION_FP_DENORMAL_SOURCE,
▲ Show 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	if (AccumOffset < 4 \|\| AccumOffset > 256 \|\| (AccumOffset & 3))
return TokError("accum_offset should be in range [4..256] in "		return TokError("accum_offset should be in range [4..256] in "
"increments of 4");		"increments of 4");
if (AccumOffset > alignTo(std::max((uint64_t)1, NextFreeVGPR), 4))		if (AccumOffset > alignTo(std::max((uint64_t)1, NextFreeVGPR), 4))
return TokError("accum_offset exceeds total VGPR allocation");		return TokError("accum_offset exceeds total VGPR allocation");
AMDHSA_BITS_SET(KD.compute_pgm_rsrc3, COMPUTE_PGM_RSRC3_GFX90A_ACCUM_OFFSET,		AMDHSA_BITS_SET(KD.compute_pgm_rsrc3, COMPUTE_PGM_RSRC3_GFX90A_ACCUM_OFFSET,
(AccumOffset / 4 - 1));		(AccumOffset / 4 - 1));
}		}

		if (IVersion.Major == 10) {
		// SharedVGPRCount < 16 checked by PARSE_ENTRY_BITS
		if (SharedVGPRCount && EnableWavefrontSize32) {
		arsenmUnsubmitted Done Reply Inline Actions Braces arsenm: Braces
		lamb-jAuthorUnsubmitted Done Reply Inline Actions Do we want braces here because the TokError() string is split over two lines? lamb-j: Do we want braces here because the TokError() string is split over two lines?
		return TokError("shared_vgpr_count directive not valid on "
		"wavefront size 32");
		}
		arsenmUnsubmitted Done Reply Inline Actions Leftover debug printing arsenm: Leftover debug printing
		if (SharedVGPRCount * 2 + VGPRBlocks > 63) {
		arsenmUnsubmitted Done Reply Inline Actions Braces, spaces around * arsenm: Braces, spaces around *
		return TokError("shared_vgpr_count*2 + "
		"compute_pgm_rsrc1.GRANULATED_WORKITEM_VGPR_COUNT cannot "
		"exceed 63\n");
		arsenmUnsubmitted Done Reply Inline Actions I don't think this needs the newline arsenm: I don't think this needs the newline
		}
		}

getTargetStreamer().EmitAmdhsaKernelDescriptor(		getTargetStreamer().EmitAmdhsaKernelDescriptor(
getSTI(), KernelName, KD, NextFreeVGPR, NextFreeSGPR, ReserveVCC,		getSTI(), KernelName, KD, NextFreeVGPR, NextFreeSGPR, ReserveVCC,
ReserveFlatScr);		ReserveFlatScr);
return false;		return false;
}		}

bool AMDGPUAsmParser::ParseDirectiveHSACodeObjectVersion() {		bool AMDGPUAsmParser::ParseDirectiveHSACodeObjectVersion() {
uint32_t Major;		uint32_t Major;
▲ Show 20 Lines • Show All 3,509 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp

Show First 20 Lines • Show All 441 Lines • ▼ Show 20 Lines	PRINT_FIELD(OS, ".amdhsa_workgroup_processor_mode", KD,
compute_pgm_rsrc1,		compute_pgm_rsrc1,
amdhsa::COMPUTE_PGM_RSRC1_WGP_MODE);		amdhsa::COMPUTE_PGM_RSRC1_WGP_MODE);
PRINT_FIELD(OS, ".amdhsa_memory_ordered", KD,		PRINT_FIELD(OS, ".amdhsa_memory_ordered", KD,
compute_pgm_rsrc1,		compute_pgm_rsrc1,
amdhsa::COMPUTE_PGM_RSRC1_MEM_ORDERED);		amdhsa::COMPUTE_PGM_RSRC1_MEM_ORDERED);
PRINT_FIELD(OS, ".amdhsa_forward_progress", KD,		PRINT_FIELD(OS, ".amdhsa_forward_progress", KD,
compute_pgm_rsrc1,		compute_pgm_rsrc1,
amdhsa::COMPUTE_PGM_RSRC1_FWD_PROGRESS);		amdhsa::COMPUTE_PGM_RSRC1_FWD_PROGRESS);
		PRINT_FIELD(OS, ".amdhsa_shared_vgpr_count", KD, compute_pgm_rsrc3,
		rampitecUnsubmitted Done Reply Inline Actions Only print it if IVersion.Major >= 10. rampitec: Only print it if IVersion.Major >= 10.
		kzhuravlUnsubmitted Done Reply Inline Actions It is already the case. See line 427 in this patch set. kzhuravl: It is already the case. See line 427 in this patch set.
		amdhsa::COMPUTE_PGM_RSRC3_GFX10_SHARED_VGPR_COUNT);
}		}
PRINT_FIELD(		PRINT_FIELD(
OS, ".amdhsa_exception_fp_ieee_invalid_op", KD,		OS, ".amdhsa_exception_fp_ieee_invalid_op", KD,
compute_pgm_rsrc2,		compute_pgm_rsrc2,
amdhsa::COMPUTE_PGM_RSRC2_ENABLE_EXCEPTION_IEEE_754_FP_INVALID_OPERATION);		amdhsa::COMPUTE_PGM_RSRC2_ENABLE_EXCEPTION_IEEE_754_FP_INVALID_OPERATION);
PRINT_FIELD(OS, ".amdhsa_exception_fp_denorm_src", KD,		PRINT_FIELD(OS, ".amdhsa_exception_fp_denorm_src", KD,
compute_pgm_rsrc2,		compute_pgm_rsrc2,
amdhsa::COMPUTE_PGM_RSRC2_ENABLE_EXCEPTION_FP_DENORMAL_SOURCE);		amdhsa::COMPUTE_PGM_RSRC2_ENABLE_EXCEPTION_FP_DENORMAL_SOURCE);
▲ Show 20 Lines • Show All 434 Lines • Show Last 20 Lines

llvm/test/MC/AMDGPU/hsa-diag-v3.s

	Show First 20 Lines • Show All 219 Lines • ▼ Show 20 Lines
	// NONGFX10: error: directive requires gfx10+			// NONGFX10: error: directive requires gfx10+
	// GFX10: error: value out of range			// GFX10: error: value out of range
	// NONAMDHSA: error: unknown directive			// NONAMDHSA: error: unknown directive
	.warning "test_amdhsa_forward_progress_invalid"			.warning "test_amdhsa_forward_progress_invalid"
	.amdhsa_kernel test_amdhsa_forward_progress_invalid			.amdhsa_kernel test_amdhsa_forward_progress_invalid
	.amdhsa_forward_progress 5			.amdhsa_forward_progress 5
	.end_amdhsa_kernel			.end_amdhsa_kernel

				// GCN-LABEL: warning: test_amdhsa_shared_vgpr_count_invalid1
				// NONGFX10: error: directive requires gfx10+
				// GFX10: error: .amdhsa_next_free_vgpr directive is required
				// NONAMDHSA: error: unknown directive
				.warning "test_amdhsa_shared_vgpr_count_invalid1"
				.amdhsa_kernel test_amdhsa_shared_vgpr_count_invalid1
				.amdhsa_shared_vgpr_count 8
				.end_amdhsa_kernel

				// GCN-LABEL: warning: test_amdhsa_shared_vgpr_count_invalid2
				// NONGFX10: error: directive requires gfx10+
				// GFX10: error: shared_vgpr_count directive not valid on wavefront size 32
				// NONAMDHSA: error: unknown directive
				.warning "test_amdhsa_shared_vgpr_count_invalid2"
				.amdhsa_kernel test_amdhsa_shared_vgpr_count_invalid2
				.amdhsa_next_free_vgpr 16
				.amdhsa_next_free_sgpr 0
				.amdhsa_shared_vgpr_count 8
				.amdhsa_wavefront_size32 1
				.end_amdhsa_kernel

				// GCN-LABEL: warning: test_amdhsa_shared_vgpr_count_invalid3
				// NONGFX10: error: directive requires gfx10+
				// GFX10: error: value out of range
				// NONAMDHSA: error: unknown directive
				.warning "test_amdhsa_shared_vgpr_count_invalid3"
				.amdhsa_kernel test_amdhsa_shared_vgpr_count_invalid3
				.amdhsa_next_free_vgpr 32
				.amdhsa_next_free_sgpr 0
				.amdhsa_shared_vgpr_count 16
				.end_amdhsa_kernel

				// GCN-LABEL: warning: test_amdhsa_shared_vgpr_count_invalid4
				// NONGFX10: error: directive requires gfx10+
				// GFX10: error: shared_vgpr_count*2 + compute_pgm_rsrc1.GRANULATED_WORKITEM_VGPR_COUNT cannot exceed 63
				// NONAMDHSA: error: unknown directive
				.warning "test_amdhsa_shared_vgpr_count_invalid4"
				.amdhsa_kernel test_amdhsa_shared_vgpr_count_invalid4
				.amdhsa_next_free_vgpr 273
				.amdhsa_next_free_sgpr 0
				.amdhsa_shared_vgpr_count 15
				.end_amdhsa_kernel

	// GCN-LABEL: warning: test_next_free_vgpr_invalid			// GCN-LABEL: warning: test_next_free_vgpr_invalid
	// AMDHSA: error: .amdgcn.next_free_{v,s}gpr symbols must be absolute expressions			// AMDHSA: error: .amdgcn.next_free_{v,s}gpr symbols must be absolute expressions
	// NONAMDHSA-NOT: error:			// NONAMDHSA-NOT: error:
	.warning "test_next_free_vgpr_invalid"			.warning "test_next_free_vgpr_invalid"
	.set .amdgcn.next_free_vgpr, "foo"			.set .amdgcn.next_free_vgpr, "foo"
	v_mov_b32_e32 v0, s0			v_mov_b32_e32 v0, s0

	// GCN-LABEL: warning: test_end			// GCN-LABEL: warning: test_end
	.warning "test_end"			.warning "test_end"

llvm/test/MC/AMDGPU/hsa-gfx10-v3.s

	Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	// ASM: .rodata			// ASM: .rodata

	// Test that only specifying required directives is allowed, and that defaulted			// Test that only specifying required directives is allowed, and that defaulted
	// values are omitted.			// values are omitted.
	.p2align 6			.p2align 6
	.amdhsa_kernel minimal			.amdhsa_kernel minimal
	.amdhsa_next_free_vgpr 0			.amdhsa_next_free_vgpr 0
	.amdhsa_next_free_sgpr 0			.amdhsa_next_free_sgpr 0
				.amdhsa_shared_vgpr_count 0
	.end_amdhsa_kernel			.end_amdhsa_kernel

	// ASM: .amdhsa_kernel minimal			// ASM: .amdhsa_kernel minimal
	// ASM: .amdhsa_next_free_vgpr 0			// ASM: .amdhsa_next_free_vgpr 0
	// ASM-NEXT: .amdhsa_next_free_sgpr 0			// ASM-NEXT: .amdhsa_next_free_sgpr 0
				// ASM: .amdhsa_shared_vgpr_count 0
	// ASM: .end_amdhsa_kernel			// ASM: .end_amdhsa_kernel

	// Test that we can specify all available directives with non-default values.			// Test that we can specify all available directives with non-default values.
	.p2align 6			.p2align 6
	.amdhsa_kernel complete			.amdhsa_kernel complete
	.amdhsa_group_segment_fixed_size 1			.amdhsa_group_segment_fixed_size 1
	.amdhsa_private_segment_fixed_size 1			.amdhsa_private_segment_fixed_size 1
	.amdhsa_kernarg_size 8			.amdhsa_kernarg_size 8
	▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines
	// ASM-NEXT: .amdhsa_float_denorm_mode_32 1			// ASM-NEXT: .amdhsa_float_denorm_mode_32 1
	// ASM-NEXT: .amdhsa_float_denorm_mode_16_64 0			// ASM-NEXT: .amdhsa_float_denorm_mode_16_64 0
	// ASM-NEXT: .amdhsa_dx10_clamp 0			// ASM-NEXT: .amdhsa_dx10_clamp 0
	// ASM-NEXT: .amdhsa_ieee_mode 0			// ASM-NEXT: .amdhsa_ieee_mode 0
	// ASM-NEXT: .amdhsa_fp16_overflow 1			// ASM-NEXT: .amdhsa_fp16_overflow 1
	// ASM-NEXT: .amdhsa_workgroup_processor_mode 1			// ASM-NEXT: .amdhsa_workgroup_processor_mode 1
	// ASM-NEXT: .amdhsa_memory_ordered 1			// ASM-NEXT: .amdhsa_memory_ordered 1
	// ASM-NEXT: .amdhsa_forward_progress 1			// ASM-NEXT: .amdhsa_forward_progress 1
				// ASM-NEXT: .amdhsa_shared_vgpr_count 0
	// ASM-NEXT: .amdhsa_exception_fp_ieee_invalid_op 1			// ASM-NEXT: .amdhsa_exception_fp_ieee_invalid_op 1
	// ASM-NEXT: .amdhsa_exception_fp_denorm_src 1			// ASM-NEXT: .amdhsa_exception_fp_denorm_src 1
	// ASM-NEXT: .amdhsa_exception_fp_ieee_div_zero 1			// ASM-NEXT: .amdhsa_exception_fp_ieee_div_zero 1
	// ASM-NEXT: .amdhsa_exception_fp_ieee_overflow 1			// ASM-NEXT: .amdhsa_exception_fp_ieee_overflow 1
	// ASM-NEXT: .amdhsa_exception_fp_ieee_underflow 1			// ASM-NEXT: .amdhsa_exception_fp_ieee_underflow 1
	// ASM-NEXT: .amdhsa_exception_fp_ieee_inexact 1			// ASM-NEXT: .amdhsa_exception_fp_ieee_inexact 1
	// ASM-NEXT: .amdhsa_exception_int_div_zero 1			// ASM-NEXT: .amdhsa_exception_int_div_zero 1
	// ASM-NEXT: .end_amdhsa_kernel			// ASM-NEXT: .end_amdhsa_kernel
	▲ Show 20 Lines • Show All 60 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Add gfx10 assembler directive to specify shared VGPR count
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 413637

llvm/docs/AMDGPUUsage.rst

llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp

llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp

llvm/test/MC/AMDGPU/hsa-diag-v3.s

llvm/test/MC/AMDGPU/hsa-gfx10-v3.s

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Add gfx10 assembler directive to specify shared VGPR countClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 413637

llvm/docs/AMDGPUUsage.rst

llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp

llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp

llvm/test/MC/AMDGPU/hsa-diag-v3.s

llvm/test/MC/AMDGPU/hsa-gfx10-v3.s

[AMDGPU] Add gfx10 assembler directive to specify shared VGPR count
ClosedPublic