This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Conversion/
-
mlir/
-
Conversion/
-
Passes.td
-
lib/Conversion/GPUCommon/
-
Conversion/
-
GPUCommon/
-
GPUToLLVMConversion.cpp
-
test/Conversion/GPUCommon/
-
Conversion/
-
GPUCommon/
-
transfer_write.mlir

Differential D141987

[mlir] Add "memref::MemRefDialect" as dependentDialects for GpuToLLVMConversionPass
ClosedPublic

Authored by python3kgae on Jan 17 2023, 8:22 PM.

Download Raw Diff

Details

Reviewers

csigg
herhut
ftynse
ThomasRaoux
dcaballe

Commits

rG16f8d17f7bd8: [mlir] Add "memref::MemRefDialect" as dependentDialects for…

Summary

For https://github.com/llvm/llvm-project/issues/60070.
The issue is caused by memref.store is not registed.
Registe it by add "memref::MemRefDialect" as dependetDialects for GpuToLLVMConsersionPass.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

python3kgae created this revision.Jan 17 2023, 8:22 PM

Herald added a reviewer: ftynse. · View Herald TranscriptJan 17 2023, 8:22 PM

Herald added a reviewer: ThomasRaoux. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: Moerafaat, zero9178, bzcheeseman and 21 others. · View Herald Transcript

python3kgae requested review of this revision.Jan 17 2023, 8:22 PM

Herald added a reviewer: dcaballe. · View Herald TranscriptJan 17 2023, 8:22 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B208400: Diff 490027.Jan 17 2023, 8:54 PM

Do you know why we are generating ops from the memref dialect in gpu to llvm conversion in the first place?

In D141987#4061479, @ftynse wrote:

Do you know why we are generating ops from the memref dialect in gpu to llvm conversion in the first place?

It happens in VectorStoreToMemrefStoreLowering::matchAndRewrite https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp#L2203 where rewriter.replaceOpWithNewOp<memref::StoreOp> is called.

It is added by populateVectorToLLVMConversionPatterns in GpuToLLVMConversionPass::runOnOperation.

But windows build seems go different pattern and not have the dependency. :( So the windows build behavior is the correct path and gpu to llvm conversion don't need memref dialect?

Both VectorStoreToMemrefStoreLowering and VectorLoadStoreConversion<mlir::vector::StoreOp, mlir::vector::StoreOpAdaptor> are added to legalize vector.store.
On windows, VectorLoadStoreConversion has higher benefit, so it does not crash.
And on linux, VectorStoreToMemrefStoreLowering has higher benefit, and memref.store is not resisted, so it will crash.

If patterns added for vector.store are expected to be selected, memref.store should be registered.

Still not sure why windows and linux has different benefit for VectorStoreToMemrefStoreLowering and VectorLoadStoreConversion<mlir::vector::StoreOp, mlir::vector::StoreOpAdaptor>. Should I create an issue for the difference?

Does one of the patterns actually has a higher benefit value? Or do they have the default value and one just happens to be selected instead of the other? In the latter case, it sounds like there is some non-determinism involved (e.g., something stored in a hashmap) that is exposed by platform differences.

In D141987#4064561, @ftynse wrote:

Does one of the patterns actually has a higher benefit value? Or do they have the default value and one just happens to be selected instead of the other? In the latter case, it sounds like there is some non-determinism involved (e.g., something stored in a hashmap) that is exposed by platform differences.

The difference is caused by llvm::array_pod_sort used in

unsigned OperationLegalizer::applyCostModelToPatterns(
       unsigned generatedOpDepth = computeOpLegalizationDepth(
           generatedOp, minOpPatternDepth, legalizerPatterns);
       depth = std::max(depth, generatedOpDepth + 1);
     }

If change it to std::stable_sort, both windows and linux will select VectorStoreToMemrefStoreLowering and crash.

Create https://reviews.llvm.org/D142110 for the non-determinism between windows and linux.

It happens in VectorStoreToMemrefStoreLowering::matchAndRewrite https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp#L2203 where rewriter.replaceOpWithNewOp<memref::StoreOp> is called.

I suppose the same problem is possible with just -convert-vector-to-llvm. This is what happens with uncontrolled addition of patterns :(

ftynse accepted this revision.Jan 20 2023, 6:09 AM

This revision is now accepted and ready to land.Jan 20 2023, 6:09 AM

rebase to get stable_sort fix

Harbormaster completed remote builds in B209007: Diff 490878.Jan 20 2023, 9:58 AM

Closed by commit rG16f8d17f7bd8: [mlir] Add "memref::MemRefDialect" as dependentDialects for… (authored by python3kgae). · Explain WhyJan 20 2023, 11:20 AM

This revision was automatically updated to reflect the committed changes.

python3kgae added a commit: rG16f8d17f7bd8: [mlir] Add "memref::MemRefDialect" as dependentDialects for….

Revision Contents

Path

Size

mlir/

include/

mlir/

Conversion/

Passes.td

5 lines

lib/

Conversion/

GPUCommon/

GPUToLLVMConversion.cpp

1 line

test/

Conversion/

GPUCommon/

transfer_write.mlir

13 lines

Diff 490027

mlir/include/mlir/Conversion/Passes.td

	Show First 20 Lines • Show All 329 Lines • ▼ Show 20 Lines

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// GPUCommon			// GPUCommon
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def GpuToLLVMConversionPass : Pass<"gpu-to-llvm", "ModuleOp"> {			def GpuToLLVMConversionPass : Pass<"gpu-to-llvm", "ModuleOp"> {
	let summary = "Convert GPU dialect to LLVM dialect with GPU runtime calls";			let summary = "Convert GPU dialect to LLVM dialect with GPU runtime calls";
	let constructor = "mlir::createGpuToLLVMConversionPass()";			let constructor = "mlir::createGpuToLLVMConversionPass()";
	let dependentDialects = ["LLVM::LLVMDialect"];			let dependentDialects = [
				"LLVM::LLVMDialect",
				"memref::MemRefDialect",
				];
	}			}

	def LowerHostCodeToLLVM : Pass<"lower-host-to-llvm", "ModuleOp"> {			def LowerHostCodeToLLVM : Pass<"lower-host-to-llvm", "ModuleOp"> {
	let summary = "Lowers the host module code and `gpu.launch_func` to LLVM";			let summary = "Lowers the host module code and `gpu.launch_func` to LLVM";
	let constructor = "mlir::createLowerHostCodeToLLVMPass()";			let constructor = "mlir::createLowerHostCodeToLLVMPass()";
	let dependentDialects = ["LLVM::LLVMDialect"];			let dependentDialects = ["LLVM::LLVMDialect"];
	}			}

	▲ Show 20 Lines • Show All 659 Lines • Show Last 20 Lines

mlir/lib/Conversion/GPUCommon/GPUToLLVMConversion.cpp

	Show All 22 Lines
	#include "mlir/Conversion/LLVMCommon/ConversionTarget.h"			#include "mlir/Conversion/LLVMCommon/ConversionTarget.h"
	#include "mlir/Conversion/LLVMCommon/Pattern.h"			#include "mlir/Conversion/LLVMCommon/Pattern.h"
	#include "mlir/Conversion/MemRefToLLVM/MemRefToLLVM.h"			#include "mlir/Conversion/MemRefToLLVM/MemRefToLLVM.h"
	#include "mlir/Conversion/VectorToLLVM/ConvertVectorToLLVM.h"			#include "mlir/Conversion/VectorToLLVM/ConvertVectorToLLVM.h"
	#include "mlir/Dialect/Async/IR/Async.h"			#include "mlir/Dialect/Async/IR/Async.h"
	#include "mlir/Dialect/GPU/IR/GPUDialect.h"			#include "mlir/Dialect/GPU/IR/GPUDialect.h"
	#include "mlir/Dialect/GPU/Transforms/Passes.h"			#include "mlir/Dialect/GPU/Transforms/Passes.h"
	#include "mlir/Dialect/LLVMIR/LLVMDialect.h"			#include "mlir/Dialect/LLVMIR/LLVMDialect.h"
				#include "mlir/Dialect/MemRef/IR/MemRef.h"
	#include "mlir/IR/Attributes.h"			#include "mlir/IR/Attributes.h"
	#include "mlir/IR/Builders.h"			#include "mlir/IR/Builders.h"
	#include "mlir/IR/BuiltinOps.h"			#include "mlir/IR/BuiltinOps.h"
	#include "mlir/IR/BuiltinTypes.h"			#include "mlir/IR/BuiltinTypes.h"

	#include "llvm/ADT/STLExtras.h"			#include "llvm/ADT/STLExtras.h"
	#include "llvm/Support/Error.h"			#include "llvm/Support/Error.h"
	#include "llvm/Support/FormatVariadic.h"			#include "llvm/Support/FormatVariadic.h"
	▲ Show 20 Lines • Show All 894 Lines • Show Last 20 Lines

mlir/test/Conversion/GPUCommon/transfer_write.mlir

This file was added.

				// RUN: mlir-opt %s --gpu-to-llvm \| FileCheck %s

				func.func @warp_extract(%arg0: index, %arg1: memref<1024x1024xf32>, %arg2: index, %arg3: vector<1xf32>) {
				%c0 = arith.constant 0 : index
				vector.warp_execute_on_lane_0(%arg0)[32] {
				// CHECK:%[[val:[0-9]+]] = llvm.extractelement
				// CHECK:%[[base:[0-9]+]] = llvm.extractvalue
				// CHECK:%[[ptr:[0-9]+]] = llvm.getelementptr %[[base]]
				// CHECK:llvm.store %[[val]], %[[ptr]]
				vector.transfer_write %arg3, %arg1[%c0, %c0] {in_bounds = [true]} : vector<1xf32>, memref<1024x1024xf32>
				}
				return
				}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Add "memref::MemRefDialect" as dependentDialects for GpuToLLVMConversionPassClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 490027

mlir/include/mlir/Conversion/Passes.td

mlir/lib/Conversion/GPUCommon/GPUToLLVMConversion.cpp

mlir/test/Conversion/GPUCommon/transfer_write.mlir

[mlir] Add "memref::MemRefDialect" as dependentDialects for GpuToLLVMConversionPass
ClosedPublic