This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/LLVMIR/
-
mlir/
-
Dialect/
-
LLVMIR/
1/1
NVVMOps.td
-
lib/Target/LLVMIR/Dialect/NVVM/
-
Target/
-
LLVMIR/
-
Dialect/
-
NVVM/
-
NVVMToLLVMIRTranslation.cpp
-
test/Target/LLVMIR/
-
Target/
-
LLVMIR/
-
nvvmir.mlir

Differential D136931

[mlir][nvvm] Introduce performance tuning directives
ClosedPublic

Authored by guraypp on Oct 28 2022, 2:40 AM.

Download Raw Diff

Details

Reviewers

ftynse
bondhugula
nicolasvasilache
dcaballe

Commits

rG3ac17449cf98: [mlir][nvvm] Introduce performance tuning directives

Summary

PTX programming models provides some performance tuning directives; see https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#performance-tuning-directives

The downstream compiler namely ptxas leverages these information for better register allocation or to handle other resource management that improves the performance.

This revision introduce all the kernel based directives to MLIR's NVVM dialect. The list is below

maxnreg			-> 	max register per thread in CTA
maxntid			-> 	max threads per CTA
reqntid			-> 	exact number of threads per CTA
minnctapersm		-> 	min CTA per SM

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

guraypp created this revision.Oct 28 2022, 2:40 AM

Herald added a reviewer: ftynse. · View Herald TranscriptOct 28 2022, 2:40 AM

Herald added a reviewer: bondhugula. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: zero9178, bzcheeseman, mattd and 22 others. · View Herald Transcript

guraypp requested review of this revision.Oct 28 2022, 2:40 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptOct 28 2022, 2:40 AM

Herald added a reviewer: dcaballe. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache, jholewinski. · View Herald Transcript

guraypp edited the summary of this revision. (Show Details)Oct 28 2022, 2:40 AM

ftynse added inline comments.Oct 28 2022, 2:45 AM

mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
53–59	Can we add a dialect attribute verifier for these to be of the right type?

Harbormaster completed remote builds in B194871: Diff 471456.Oct 28 2022, 3:22 AM

Add attribute verifiers

guraypp marked an inline comment as done.Oct 28 2022, 4:47 AM

ftynse accepted this revision.Oct 28 2022, 4:56 AM

This revision is now accepted and ready to land.Oct 28 2022, 4:56 AM

Harbormaster completed remote builds in B194898: Diff 471492.Oct 28 2022, 5:01 AM

Closed by commit rG3ac17449cf98: [mlir][nvvm] Introduce performance tuning directives (authored by guraypp). · Explain WhyOct 28 2022, 5:02 AM

This revision was automatically updated to reflect the committed changes.

guraypp added a commit: rG3ac17449cf98: [mlir][nvvm] Introduce performance tuning directives.

Hi Guray,

This change-set seems to break the build:

ld.lld: error: undefined symbol: mlir::extractFromI64ArrayAttr(mlir::Attribute)
>>> referenced by NVVMToLLVMIRTranslation.cpp:142 (/llvm-project/mlir/lib/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.cpp:142)
>>>               CMakeFiles/obj.MLIRNVVMToLLVMIRTranslation.dir/NVVMToLLVMIRTranslation.cpp.o:((anonymous namespace)::NVVMDialectLLVMIRTranslationInterface::amendOperation(mlir::Operation*, mlir::NamedAttribute, mlir::LLVM::ModuleTranslation&) const)
>>> referenced by NVVMToLLVMIRTranslation.cpp:152 (/llvm-project/mlir/lib/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.cpp:152)
>>>               CMakeFiles/obj.MLIRNVVMToLLVMIRTranslation.dir/NVVMToLLVMIRTranslation.cpp.o:((anonymous namespace)::NVVMDialectLLVMIRTranslationInterface::amendOperation(mlir::Operation*, mlir::NamedAttribute, mlir::LLVM::ModuleTranslation&) const)
collect2: error: ld returned 1 exit status

It looks like there is missing dependency on MLIRDialectUtils.

Herald added a subscriber: Moerafaat. · View Herald TranscriptOct 28 2022, 5:44 PM

vzakhari mentioned this in rGb9255099382d: Fix MLIR build after D136931.Oct 28 2022, 5:51 PM

I tried to fix it in https://github.com/llvm/llvm-project/commit/b9255099382d0121e567979e7d5b4ca1328b4a41

In D136931#3893571, @vzakhari wrote:

I tried to fix it in https://github.com/llvm/llvm-project/commit/b9255099382d0121e567979e7d5b4ca1328b4a41

Hi Slava, thanks for fixing it.
I am not sure how did I miss it, I remember the buildbot was fine.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

LLVMIR/

NVVMOps.td

23 lines

lib/

Target/

LLVMIR/

Dialect/

NVVM/

NVVMToLLVMIRTranslation.cpp

61 lines

test/

Target/

LLVMIR/

nvvmir.mlir

63 lines

Diff 471456

mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td

Show All 28 Lines	def NVVM_Dialect : Dialect {
let cppNamespace = "::mlir::NVVM";		let cppNamespace = "::mlir::NVVM";
let dependentDialects = ["LLVM::LLVMDialect"];		let dependentDialects = ["LLVM::LLVMDialect"];
let hasOperationAttrVerify = 1;		let hasOperationAttrVerify = 1;

let extraClassDeclaration = [{		let extraClassDeclaration = [{
/// Get the name of the attribute used to annotate external kernel		/// Get the name of the attribute used to annotate external kernel
/// functions.		/// functions.
static StringRef getKernelFuncAttrName() { return "nvvm.kernel"; }		static StringRef getKernelFuncAttrName() { return "nvvm.kernel"; }
		/// Get the name of the attribute used to annotate max threads required
		/// per CTA for kernel functions.
		static StringRef getMaxntidAttrName() { return "nvvm.maxntid"; }
		/// Get the name of the metadata names for each dimension
		static StringRef getMaxntidXName() { return "maxntidx"; }
		static StringRef getMaxntidYName() { return "maxntidy"; }
		static StringRef getMaxntidZName() { return "maxntidz"; }

		/// Get the name of the attribute used to annotate exact threads required
		/// per CTA for kernel functions.
		static StringRef getReqntidAttrName() { return "nvvm.reqntid"; }
		/// Get the name of the metadata names for each dimension
		static StringRef getReqntidXName() { return "reqntidx"; }
		static StringRef getReqntidYName() { return "reqntidy"; }
		static StringRef getReqntidZName() { return "reqntidz"; }

		/// Get the name of the attribute used to annotate min CTA required
		/// per SM for kernel functions.
		static StringRef getMinctasmAttrName() { return "nvvm.minctasm"; }

		/// Get the name of the attribute used to annotate max number of
		/// registers that can be allocated per thread.
		static StringRef getMaxnregAttrName() { return "nvvm.maxnreg"; }
		ftynseUnsubmitted Done Reply Inline Actions Can we add a dialect attribute verifier for these to be of the right type? ftynse: Can we add a dialect attribute verifier for these to be of the right type?
}];		}];

let useDefaultAttributePrinterParser = 1;		let useDefaultAttributePrinterParser = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// NVVM op definitions		// NVVM op definitions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 977 Lines • Show Last 20 Lines

mlir/lib/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.cpp

//===- NVVMToLLVMIRTranslation.cpp - Translate NVVM to LLVM IR ------------===//		//===- NVVMToLLVMIRTranslation.cpp - Translate NVVM to LLVM IR ------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file implements a translation between the MLIR NVVM dialect and		// This file implements a translation between the MLIR NVVM dialect and
// LLVM IR.		// LLVM IR.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.h"		#include "mlir/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.h"
#include "mlir/Dialect/LLVMIR/NVVMDialect.h"		#include "mlir/Dialect/LLVMIR/NVVMDialect.h"
		#include "mlir/Dialect/Utils/StaticValueUtils.h"
#include "mlir/IR/Operation.h"		#include "mlir/IR/Operation.h"
#include "mlir/Target/LLVMIR/ModuleTranslation.h"		#include "mlir/Target/LLVMIR/ModuleTranslation.h"

#include "llvm/IR/IRBuilder.h"		#include "llvm/IR/IRBuilder.h"
#include "llvm/IR/IntrinsicsNVPTX.h"		#include "llvm/IR/IntrinsicsNVPTX.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::LLVM;		using namespace mlir::LLVM;
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	#include "mlir/Dialect/LLVMIR/NVVMConversions.inc"

return failure();		return failure();
}		}

/// Attaches module-level metadata for functions marked as kernels.		/// Attaches module-level metadata for functions marked as kernels.
LogicalResult		LogicalResult
amendOperation(Operation *op, NamedAttribute attribute,		amendOperation(Operation *op, NamedAttribute attribute,
LLVM::ModuleTranslation &moduleTranslation) const final {		LLVM::ModuleTranslation &moduleTranslation) const final {
if (attribute.getName() == NVVM::NVVMDialect::getKernelFuncAttrName()) {
auto func = dyn_cast<LLVM::LLVMFuncOp>(op);		auto func = dyn_cast<LLVM::LLVMFuncOp>(op);
if (!func)		if (!func)
return failure();		return failure();

llvm::LLVMContext &llvmContext = moduleTranslation.getLLVMContext();		llvm::LLVMContext &llvmContext = moduleTranslation.getLLVMContext();
llvm::Function *llvmFunc =		llvm::Function *llvmFunc = moduleTranslation.lookupFunction(func.getName());
moduleTranslation.lookupFunction(func.getName());
		auto generateMetadata = [&](int dim, StringRef name) {
llvm::Metadata *llvmMetadata[] = {		llvm::Metadata *llvmMetadata[] = {
llvm::ValueAsMetadata::get(llvmFunc),		llvm::ValueAsMetadata::get(llvmFunc),
		llvm::MDString::get(llvmContext, name),
		llvm::ValueAsMetadata::get(llvm::ConstantInt::get(
		llvm::Type::getInt32Ty(llvmContext), dim))};
		llvm::MDNode *llvmMetadataNode =
		llvm::MDNode::get(llvmContext, llvmMetadata);
		moduleTranslation.getOrInsertNamedModuleMetadata("nvvm.annotations")
		->addOperand(llvmMetadataNode);
		};
		if (attribute.getName() == NVVM::NVVMDialect::getMaxntidAttrName()) {
		if (!attribute.getValue().dyn_cast<ArrayAttr>())
		return failure();
		SmallVector<int64_t> values =
		extractFromI64ArrayAttr(attribute.getValue());
		if (!values.empty())
		generateMetadata(values[0], NVVM::NVVMDialect::getMaxntidXName());
		if (values.size() > 1)
		generateMetadata(values[1], NVVM::NVVMDialect::getMaxntidYName());
		if (values.size() > 2)
		generateMetadata(values[2], NVVM::NVVMDialect::getMaxntidZName());
		} else if (attribute.getName() == NVVM::NVVMDialect::getReqntidAttrName()) {
		if (!attribute.getValue().dyn_cast<ArrayAttr>())
		return failure();
		SmallVector<int64_t> values =
		extractFromI64ArrayAttr(attribute.getValue());
		if (!values.empty())
		generateMetadata(values[0], NVVM::NVVMDialect::getReqntidXName());
		if (values.size() > 1)
		generateMetadata(values[1], NVVM::NVVMDialect::getReqntidYName());
		if (values.size() > 2)
		generateMetadata(values[2], NVVM::NVVMDialect::getReqntidZName());
		} else if (attribute.getName() ==
		NVVM::NVVMDialect::getMinctasmAttrName()) {
		auto value = attribute.getValue().dyn_cast<IntegerAttr>();
		if (!value)
		return success();
		generateMetadata(value.getInt(), "minctasm");
		} else if (attribute.getName() == NVVM::NVVMDialect::getMaxnregAttrName()) {
		auto value = attribute.getValue().dyn_cast<IntegerAttr>();
		if (!value)
		return success();
		generateMetadata(value.getInt(), "maxnreg");
		} else if (attribute.getName() ==
		NVVM::NVVMDialect::getKernelFuncAttrName()) {
		llvm::Metadata *llvmMetadataKernel[] = {
		llvm::ValueAsMetadata::get(llvmFunc),
llvm::MDString::get(llvmContext, "kernel"),		llvm::MDString::get(llvmContext, "kernel"),
llvm::ValueAsMetadata::get(		llvm::ValueAsMetadata::get(
llvm::ConstantInt::get(llvm::Type::getInt32Ty(llvmContext), 1))};		llvm::ConstantInt::get(llvm::Type::getInt32Ty(llvmContext), 1))};
llvm::MDNode *llvmMetadataNode =		llvm::MDNode *llvmMetadataNode =
llvm::MDNode::get(llvmContext, llvmMetadata);		llvm::MDNode::get(llvmContext, llvmMetadataKernel);
moduleTranslation.getOrInsertNamedModuleMetadata("nvvm.annotations")		moduleTranslation.getOrInsertNamedModuleMetadata("nvvm.annotations")
->addOperand(llvmMetadataNode);		->addOperand(llvmMetadataNode);
}		}
return success();		return success();
}		}
};		};
} // namespace		} // namespace

Show All 12 Lines

mlir/test/Target/LLVMIR/nvvmir.mlir

	// RUN: mlir-translate -mlir-to-llvmir %s \| FileCheck %s			// RUN: mlir-translate -mlir-to-llvmir %s -split-input-file \| FileCheck %s

	// CHECK-LABEL: @nvvm_special_regs			// CHECK-LABEL: @nvvm_special_regs
	llvm.func @nvvm_special_regs() -> i32 {			llvm.func @nvvm_special_regs() -> i32 {
	// CHECK: %1 = call i32 @llvm.nvvm.read.ptx.sreg.tid.x()			// CHECK: %1 = call i32 @llvm.nvvm.read.ptx.sreg.tid.x()
	%1 = nvvm.read.ptx.sreg.tid.x : i32			%1 = nvvm.read.ptx.sreg.tid.x : i32
	// CHECK: call i32 @llvm.nvvm.read.ptx.sreg.tid.y()			// CHECK: call i32 @llvm.nvvm.read.ptx.sreg.tid.y()
	%2 = nvvm.read.ptx.sreg.tid.y : i32			%2 = nvvm.read.ptx.sreg.tid.y : i32
	// CHECK: call i32 @llvm.nvvm.read.ptx.sreg.tid.z()			// CHECK: call i32 @llvm.nvvm.read.ptx.sreg.tid.z()
	▲ Show 20 Lines • Show All 334 Lines • ▼ Show 20 Lines
	// NVVM annotations after conversion.			// NVVM annotations after conversion.
	llvm.func @kernel_func() attributes {nvvm.kernel} {			llvm.func @kernel_func() attributes {nvvm.kernel} {
	llvm.return			llvm.return
	}			}

	// CHECK: !nvvm.annotations =			// CHECK: !nvvm.annotations =
	// CHECK-NOT: {ptr @nvvm_special_regs, !"kernel", i32 1}			// CHECK-NOT: {ptr @nvvm_special_regs, !"kernel", i32 1}
	// CHECK: {ptr @kernel_func, !"kernel", i32 1}			// CHECK: {ptr @kernel_func, !"kernel", i32 1}

				// -----

				llvm.func @kernel_func() attributes {nvvm.kernel, nvvm.maxntid = [1,23,32]} {
				llvm.return
				}

				// CHECK: !nvvm.annotations =
				// CHECK-NOT: {ptr @nvvm_special_regs, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"maxntidx", i32 1}
				// CHECK: {ptr @kernel_func, !"maxntidy", i32 23}
				// CHECK: {ptr @kernel_func, !"maxntidz", i32 32}
				// -----

				llvm.func @kernel_func() attributes {nvvm.kernel, nvvm.reqntid = [1,23,32]} {
				llvm.return
				}

				// CHECK: !nvvm.annotations =
				// CHECK-NOT: {ptr @nvvm_special_regs, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"reqntidx", i32 1}
				// CHECK: {ptr @kernel_func, !"reqntidy", i32 23}
				// CHECK: {ptr @kernel_func, !"reqntidz", i32 32}
				// -----

				llvm.func @kernel_func() attributes {nvvm.kernel, nvvm.minctasm = 16} {
				llvm.return
				}

				// CHECK: !nvvm.annotations =
				// CHECK-NOT: {ptr @nvvm_special_regs, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"minctasm", i32 16}
				// -----

				llvm.func @kernel_func() attributes {nvvm.kernel, nvvm.maxnreg = 16} {
				llvm.return
				}

				// CHECK: !nvvm.annotations =
				// CHECK-NOT: {ptr @nvvm_special_regs, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"maxnreg", i32 16}
				// -----

				llvm.func @kernel_func() attributes {nvvm.kernel, nvvm.maxntid = [1,23,32],
				nvvm.minctasm = 16, nvvm.maxnreg = 32} {
				llvm.return
				}

				// CHECK: !nvvm.annotations =
				// CHECK-NOT: {ptr @nvvm_special_regs, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"kernel", i32 1}
				// CHECK: {ptr @kernel_func, !"maxnreg", i32 32}
				// CHECK: {ptr @kernel_func, !"maxntidx", i32 1}
				// CHECK: {ptr @kernel_func, !"maxntidy", i32 23}
				// CHECK: {ptr @kernel_func, !"maxntidz", i32 32}
				// CHECK: {ptr @kernel_func, !"minctasm", i32 16}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][nvvm] Introduce performance tuning directivesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 471456

mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td

mlir/lib/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.cpp

mlir/test/Target/LLVMIR/nvvmir.mlir

[mlir][nvvm] Introduce performance tuning directives
ClosedPublic