Download Raw Diff

Details

Reviewers

nicolasvasilache
chelini
qedawkins
mravishankar

Commits

rG8eed9f38ca36: [mlir][linalg] Add support for folding pack(fill) into fill.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hanchung created this revision.May 3 2023, 4:46 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 3 2023, 4:46 PM

Herald added subscribers: bviyer, Moerafaat, bzcheeseman and 22 others. · View Herald Transcript

hanchung requested review of this revision.May 3 2023, 4:46 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 3 2023, 4:46 PM

Herald added subscribers: limo1996, stephenneuendorffer. · View Herald Transcript

Harbormaster completed remote builds in B229855: Diff 519303.May 3 2023, 5:17 PM

clang-format

Harbormaster completed remote builds in B229861: Diff 519311.May 3 2023, 5:59 PM

Mostly LGTM. Also I have a question about how best to handle patterns for named linalg ops going forward.

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	Can we forward the new pack op destination to the fill op if the result is a tensor.empty (same as we're doing for generic elementwise in D149250)? This should be fine because the padding value and fill value are the same.
492	Once padding is supported (with some toggle option) in propagation through generics, would it be better to let the `linalg.fill` building happen via a callback specified in `BubbleUpPackOpThroughFillOpPattern`? Then we can reuse the pattern for generic ops for any linalg op which could help consolidate code at the expense of extra analysis in certain cases like this.

This revision is now accepted and ready to land.May 3 2023, 6:03 PM

chelini added inline comments.May 4 2023, 1:14 AM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
481	nit, I would write: `SmallVector<Value> operands = {fillOp.getDpsInitOperand(0)->get(), packOpDest};`
486	nit: `operands.append(packOp.getInnerTiles().begin(), packOp.getInnerTiles().end());`
mlir/test/Dialect/Linalg/data-layout-propagation.mlir
836	Do we plan to fold the pack into the fill op in another PR? I would avoid the packing and make the fill operation work directly on the packed layout. Do you have some use cases where you want to preserve the packing? If so, we could do this folding in `populateSimplifyTensorPack`.

hanchung marked 4 inline comments as done.May 4 2023, 3:51 PM

hanchung added inline comments.

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	I think we can drop the pack op directly after I read chelini's comment. I can't think a case that we'd like to preserve the new pack op because it's only used by the new fill op. What do you think?
492	Propagation with padding value is tricky, esp. for generic ops. The fill op case is fairly simple, so I'd like to scope it to the pattern at this moment.
mlir/test/Dialect/Linalg/data-layout-propagation.mlir
836	Sounds really good to me, maybe we can do it in this PR..

address commets

hanchung retitled this revision from [mlir][linalg] Add support for propagating pack op through linalg.fill to [mlir][linalg] Add support for folding pack(fill) into fill..May 4 2023, 3:52 PM

Harbormaster completed remote builds in B230123: Diff 519685.May 4 2023, 4:17 PM

qedawkins added inline comments.May 4 2023, 4:52 PM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	That's probably fine too, although I think they work out to be essentially the same thing whenever the fill is on an empty tensor (with an extra floating pack that can get DCE'd). Correct me if I'm wrong about that though. My only concern was if the fill op destination was a non-empty tensor for whatever reason and we would then want to maintain the use-def chain similar to bubbling up through pure elementwise (as noted in D149250, given that fill is just a specific kind of pure elementwise). The difference is that filling a tensor is only sensible on an empty tensor then? Basically I don't see why a named op gets to be propagated differently than an equivalent generic (without padding). At a minimum I would expect us to be using the `LinalgFillOpInterface` for this then.

hanchung added inline comments.May 4 2023, 5:55 PM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	My only concern was if the fill op destination was a non-empty tensor for whatever reason... It looks fine to me because all the things in tensors world are destructive. Filling on whatever tensors (maybe except bufferization.alloc_tensor op) should create a new tensor with filled value. The pass is intended to be applied at tensor graph level, which is at a higher level than bufferization. So I think it is fine. The generic op version of the fill op is very rare to me.. I agree that it could be a case, but we don't have to expose it at this moment. I have an idea for supporting padding cases in limit cases, and it could cover the case. At a minimum I would expect us to be using the LinalgFillOpInterface for this then. I was not aware of the interface. I studied it, and found that we can't use it here. The property is having a scalar `value` operand and one `output` operand. The computation body can be different, e.g., we could define FillExpOp using the interface which fills the arith.exp(scalar) into the output operand. In the context, we'll need different check between padding value and filled value. This interface is only used by linalg.fill op now, I think we can scope it to linalg.fill at this moment. If we really have other needs, it should be easy to extend.

hanchung added inline comments.May 4 2023, 6:00 PM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	I'm open to keep the pack chain; fold them away only if the dest tensor is a tensor.empty op. There could be other use cases that I haven't seen before.

qedawkins added inline comments.May 4 2023, 6:30 PM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	I see, this is fine then and I didn't intend to block. I just figured named linalg ops should try to be as close as possible to the generic patterns (the only difference being the pack chain in this case), but agreed that it would be strange usage of a fill op that I haven't seen either. I can go either way on keeping the pack chain because the pattern looks cleaner without it. Thanks for the discussion!

hanchung added inline comments.May 4 2023, 7:52 PM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	I'm happy that we have the nice discussion! (I planned to wait for a day just in case if others would like to review it. No worry at all!)

chelini added inline comments.May 5 2023, 1:35 AM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	That's probably fine too, although I think they work out to be essentially the same thing whenever the fill is on an empty tensor (with an extra floating pack that can get DCE'd). Correct me if I'm wrong about that though. My only concern was if the fill op destination was a non-empty tensor for whatever reason and we would then want to maintain the use-def chain similar to bubbling up through pure elementwise (as noted in D149250, given that fill is just a specific kind of pure elementwise). The difference is that filling a tensor is only sensible on an empty tensor then? Basically I don't see why a named op gets to be propagated differently than an equivalent generic (without padding). At a minimum I would expect us to be using the `LinalgFillOpInterface` for this then. Thanks for all these discussions! In D149250, I attempted to avoid packing an 'init' operand if the 'init' is already a `tensor.empty`. In this case, empty carries only the shape; we can update it with the packed version. I start wondering if the `into` keyword in `tensor.pack` is confusing. Internally there was some discussion about it because people think we are writing into the destination tensor. Let's discuss this offline in a call!
mlir/test/Dialect/Linalg/data-layout-propagation.mlir
836	Why do we want to pass %dest as arg for the fill, seems a bit wired a tensor level. I would create a tensor.empty inside the func.
852	same here.

qedawkins added inline comments.May 5 2023, 9:07 AM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	I start wondering if the into keyword in tensor.pack is confusing. Internally there was some discussion about it because people think we are writing into the destination tensor. Let's discuss this offline in a call! Agreed, that has confused me as well; also happy to discuss in a call!

hanchung marked an inline comment as done.May 5 2023, 11:30 AM

hanchung added inline comments.

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
491–492	agree, `into` is a confusing term. We can maybe use `ins` and `outs` because it's intended to capture the shape information. happy to discuss it!
mlir/test/Dialect/Linalg/data-layout-propagation.mlir
836	It is not a big concern to me because sometimes a function can be inlined into a program. My intention was to create a minimal test for the pass. We can do this for static cases, but not dynamic shapes. We'll have to tie the dim sizes to something. Passing it as an argument is simplest way I've found. Let's do it for static shapes, and leave the dynamic case as what it is..

update tests

This revision was landed with ongoing or failed builds.May 5 2023, 11:42 AM

Closed by commit rG8eed9f38ca36: [mlir][linalg] Add support for folding pack(fill) into fill. (authored by hanchung). · Explain Why

This revision was automatically updated to reflect the committed changes.

hanchung added a commit: rG8eed9f38ca36: [mlir][linalg] Add support for folding pack(fill) into fill..

Harbormaster completed remote builds in B230296: Diff 519931.May 5 2023, 12:08 PM

nicolasvasilache added a subscriber: qcolombet.May 8 2023, 7:34 AM

nicolasvasilache added inline comments.

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
453	I had the impression that @qcolombet had written something like this ~1 month ago ? Signaling in case there is a dedup that needs to happen here.

qcolombet added inline comments.May 11 2023, 2:09 PM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
453	We definitely talked about it, but no I didn't land anything on that.

qcolombet added inline comments.May 11 2023, 2:16 PM

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp
453	I think what you have in mind @nicolasvasilache is pad + fill: https://github.com/openxla/iree/blob/ef2bb52317eed3b90618a6c16616ae14e33b7803/compiler/src/iree/compiler/Codegen/Common/TransformExtensions/CommonExtensions.cpp#L254

Diff 519934

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp

Show First 20 Lines • Show All 440 Lines • ▼ Show 20 Lines if (auto initTensor = genericOp.getDpsInitOperand(0)

->get() ->get()

.getDefiningOp<tensor::EmptyOp>()) { .getDefiningOp<tensor::EmptyOp>()) {

dest = packOpDest; dest = packOpDest;

} }

return packElementWiseOp(rewriter, genericOp, dest, packedOutIndexingMap, return packElementWiseOp(rewriter, genericOp, dest, packedOutIndexingMap,

*packInfo); *packInfo);

} }

/// Folds pack(fill) into a single fill op if

/// 1. The pack op does not have padding value, or

/// 2. The filled value and padding value are the same.

static FailureOr<FillOp>

foldFillPackIntoFillOp(RewriterBase &rewriter, tensor::PackOp packOp,

nicolasvasilacheUnsubmitted

Not Done

I had the impression that @qcolombet had written something like this ~1 month ago ?

Signaling in case there is a dedup that needs to happen here.

nicolasvasilache: I had the impression that @qcolombet had written something like this ~1 month ago ? Signaling…

qcolombetUnsubmitted

Not Done

We definitely talked about it, but no I didn't land anything on that.

qcolombet: We definitely talked about it, but no I didn't land anything on that.

qcolombetUnsubmitted

Not Done

I think what you have in mind @nicolasvasilache is pad + fill: https://github.com/openxla/iree/blob/ef2bb52317eed3b90618a6c16616ae14e33b7803/compiler/src/iree/compiler/Codegen/Common/TransformExtensions/CommonExtensions.cpp#L254

qcolombet: I think what you have in mind @nicolasvasilache is pad + fill: https://github.

ControlPropagationFn controlFn) {

auto fillOp = packOp.getSource().getDefiningOp<FillOp>();

if (!fillOp)

return failure();

// User controlled propagation function.

if (!controlFn(fillOp))

return failure();

if (auto paddingValue = packOp.getPaddingValue())

if (!isEqualConstantIntOrValue(paddingValue, fillOp.value()))

return failure();

OpBuilder::InsertionGuard guard(rewriter);

rewriter.setInsertionPoint(fillOp);

Value packOpDest = packOp.getDest();

if (!packOpDest.hasOneUse())

return failure();

if (auto emptyOp = packOpDest.getDefiningOp<tensor::EmptyOp>()) {

packOpDest = tensor::PackOp::createDestinationTensor(

rewriter, fillOp.getLoc(), fillOp.getDpsInitOperand(0)->get(),

packOp.getMixedTiles(), packOp.getInnerDimsPos(),

packOp.getOuterDimsPerm());

} else {

DominanceInfo dom(fillOp);

if (!dom.properlyDominates(packOpDest, fillOp))

return failure();

cheliniUnsubmitted

Done

nit, I would write: SmallVector<Value> operands = {fillOp.getDpsInitOperand(0)->get(), packOpDest};

chelini: nit, I would write: `SmallVector<Value> operands = {fillOp.getDpsInitOperand(0)->get()…

}

Value fillDest = packOpDest;

return clone(rewriter, fillOp, packOpDest.getType(),

{fillOp.value(), fillDest});

cheliniUnsubmitted

Done

nit: operands.append(packOp.getInnerTiles().begin(), packOp.getInnerTiles().end());

chelini: nit: `operands.append(packOp.getInnerTiles().begin(), packOp.getInnerTiles().end());`

}

/// Wrapper pattern that applies bubbleUpPackOpThroughGenericOp method. /// Wrapper pattern that applies bubbleUpPackOpThroughGenericOp method.

struct BubbleUpPackOpThroughGenericOpPattern struct BubbleUpPackOpThroughGenericOpPattern

: public OpRewritePattern<tensor::PackOp> { : public OpRewritePattern<tensor::PackOp> {

public: public:

qedawkinsUnsubmitted

Not Done

Once padding is supported (with some toggle option) in propagation through generics, would it be better to let the linalg.fill building happen via a callback specified in BubbleUpPackOpThroughFillOpPattern? Then we can reuse the pattern for generic ops for any linalg op which could help consolidate code at the expense of extra analysis in certain cases like this.

qedawkins: Once padding is supported (with some toggle option) in propagation through generics, would it…

hanchungAuthorUnsubmitted

Done

Propagation with padding value is tricky, esp. for generic ops. The fill op case is fairly simple, so I'd like to scope it to the pattern at this moment.

hanchung: Propagation with padding value is tricky, esp. for generic ops. The fill op case is fairly…

qedawkinsUnsubmitted

Done

packOp->getAttrs());

+ Value dest = newPackOp.getResult();

+ if (auto initTensor = fillOp.getDpsInitOperand(0)

+ ->get()

+ .getDefiningOp<tensor::EmptyOp>()) {

+ dest = packOpDest;

+ }

return clone(rewriter, fillOp, packOpDest.getType(),

- {fillOp.value(), newPackOp.getResult()});

+ {fillOp.value(), dest});

}

/// Wrapper pattern that applies bubbleUpPackOpThroughGenericOp method.

Can we forward the new pack op destination to the fill op if the result is a tensor.empty (same as we're doing for generic elementwise in D149250)? This should be fine because the padding value and fill value are the same.

qedawkins: Can we forward the new pack op destination to the fill op if the result is a tensor.empty (same…

hanchungAuthorUnsubmitted

Done

I think we can drop the pack op directly after I read chelini's comment. I can't think a case that we'd like to preserve the new pack op because it's only used by the new fill op. What do you think?

hanchung: I think we can drop the pack op directly after I read chelini's comment. I can't think a case…

qedawkinsUnsubmitted

Not Done

That's probably fine too, although I think they work out to be essentially the same thing whenever the fill is on an empty tensor (with an extra floating pack that can get DCE'd). Correct me if I'm wrong about that though.

My only concern was if the fill op destination was a non-empty tensor for whatever reason and we would then want to maintain the use-def chain similar to bubbling up through pure elementwise (as noted in D149250, given that fill is just a specific kind of pure elementwise). The difference is that filling a tensor is only sensible on an empty tensor then? Basically I don't see why a named op gets to be propagated differently than an equivalent generic (without padding). At a minimum I would expect us to be using the LinalgFillOpInterface for this then.

qedawkins: That's probably fine too, although I think they work out to be essentially the same thing…

hanchungAuthorUnsubmitted

Done

My only concern was if the fill op destination was a non-empty tensor for whatever reason...

It looks fine to me because all the things in tensors world are destructive. Filling on whatever tensors (maybe except bufferization.alloc_tensor op) should create a new tensor with filled value. The pass is intended to be applied at tensor graph level, which is at a higher level than bufferization. So I think it is fine.

The generic op version of the fill op is very rare to me.. I agree that it could be a case, but we don't have to expose it at this moment. I have an idea for supporting padding cases in limit cases, and it could cover the case.

At a minimum I would expect us to be using the LinalgFillOpInterface for this then.

I was not aware of the interface. I studied it, and found that we can't use it here. The property is having a scalar value operand and one output operand. The computation body can be different, e.g., we could define FillExpOp using the interface which fills the arith.exp(scalar) into the output operand. In the context, we'll need different check between padding value and filled value. This interface is only used by linalg.fill op now, I think we can scope it to linalg.fill at this moment. If we really have other needs, it should be easy to extend.

hanchung: > My only concern was if the fill op destination was a non-empty tensor for whatever reason...

hanchungAuthorUnsubmitted

Done

I'm open to keep the pack chain; fold them away only if the dest tensor is a tensor.empty op. There could be other use cases that I haven't seen before.

hanchung: I'm open to keep the pack chain; fold them away only if the dest tensor is a tensor.empty op.

qedawkinsUnsubmitted

Done

I see, this is fine then and I didn't intend to block. I just figured named linalg ops should try to be as close as possible to the generic patterns (the only difference being the pack chain in this case), but agreed that it would be strange usage of a fill op that I haven't seen either. I can go either way on keeping the pack chain because the pattern looks cleaner without it. Thanks for the discussion!

qedawkins: I see, this is fine then and I didn't intend to block. I just figured named linalg ops should…

hanchungAuthorUnsubmitted

Done

I'm happy that we have the nice discussion! (I planned to wait for a day just in case if others would like to review it. No worry at all!)

hanchung: I'm happy that we have the nice discussion! (I planned to wait for a day just in case if others…

cheliniUnsubmitted

Not Done

That's probably fine too, although I think they work out to be essentially the same thing whenever the fill is on an empty tensor (with an extra floating pack that can get DCE'd). Correct me if I'm wrong about that though.

My only concern was if the fill op destination was a non-empty tensor for whatever reason and we would then want to maintain the use-def chain similar to bubbling up through pure elementwise (as noted in D149250, given that fill is just a specific kind of pure elementwise). The difference is that filling a tensor is only sensible on an empty tensor then? Basically I don't see why a named op gets to be propagated differently than an equivalent generic (without padding). At a minimum I would expect us to be using the LinalgFillOpInterface for this then.

Thanks for all these discussions! In D149250, I attempted to avoid packing an 'init' operand if the 'init' is already a tensor.empty. In this case, empty carries only the shape; we can update it with the packed version. I start wondering if the into keyword in tensor.pack is confusing. Internally there was some discussion about it because people think we are writing into the destination tensor. Let's discuss this offline in a call!

chelini: > That's probably fine too, although I think they work out to be essentially the same thing…

qedawkinsUnsubmitted

Not Done

I start wondering if the into keyword in tensor.pack is confusing. Internally there was some discussion about it because people think we are writing into the destination tensor. Let's discuss this offline in a call!

Agreed, that has confused me as well; also happy to discuss in a call!

qedawkins: > I start wondering if the into keyword in tensor.pack is confusing. Internally there was some…

hanchungAuthorUnsubmitted

Done

agree, into is a confusing term. We can maybe use ins and outs because it's intended to capture the shape information. happy to discuss it!

hanchung: agree, `into` is a confusing term. We can maybe use `ins` and `outs` because it's intended to…

BubbleUpPackOpThroughGenericOpPattern(MLIRContext *context, BubbleUpPackOpThroughGenericOpPattern(MLIRContext *context,

ControlPropagationFn fun) ControlPropagationFn fun)

: OpRewritePattern<tensor::PackOp>(context), controlFn(std::move(fun)) {} : OpRewritePattern<tensor::PackOp>(context), controlFn(std::move(fun)) {}

LogicalResult matchAndRewrite(tensor::PackOp packOp, LogicalResult matchAndRewrite(tensor::PackOp packOp,

PatternRewriter &rewriter) const override { PatternRewriter &rewriter) const override {

auto genericOp = auto genericOp =

bubbleUpPackOpThroughGenericOp(rewriter, packOp, controlFn); bubbleUpPackOpThroughGenericOp(rewriter, packOp, controlFn);

if (failed(genericOp)) if (failed(genericOp))

return failure(); return failure();

rewriter.replaceOp(packOp, genericOp->getResults()); rewriter.replaceOp(packOp, genericOp->getResults());

return success(); return success();

} }

private: private:

ControlPropagationFn controlFn; ControlPropagationFn controlFn;

}; };

/// Wrapper pattern that applies foldFillPackIntoFillOp method.

struct FoldFillPackIntoFillOpPattern : public OpRewritePattern<tensor::PackOp> {

public:

FoldFillPackIntoFillOpPattern(MLIRContext *context, ControlPropagationFn fun)

: OpRewritePattern<tensor::PackOp>(context), controlFn(std::move(fun)) {}

LogicalResult matchAndRewrite(tensor::PackOp packOp,

PatternRewriter &rewriter) const override {

auto fillOp = foldFillPackIntoFillOp(rewriter, packOp, controlFn);

if (failed(fillOp))

return failure();

rewriter.replaceOp(packOp, fillOp.value().result());

return success();

}

private:

ControlPropagationFn controlFn;

};

// TODO: Relax this restriction. We should unpack an elementwise also // TODO: Relax this restriction. We should unpack an elementwise also

// in the presence of multiple unpack ops as producers. // in the presence of multiple unpack ops as producers.

/// Return the unpacked operand, if present, for the current generic op. /// Return the unpacked operand, if present, for the current generic op.

static FailureOr<OpOperand *> getUnPackedOperand(GenericOp genericOp) { static FailureOr<OpOperand *> getUnPackedOperand(GenericOp genericOp) {

OpOperand *unPackedOperand = nullptr; OpOperand *unPackedOperand = nullptr;

for (OpOperand &operand : genericOp->getOpOperands()) { for (OpOperand &operand : genericOp->getOpOperands()) {

auto unPackOp = operand.get().getDefiningOp<tensor::UnPackOp>(); auto unPackOp = operand.get().getDefiningOp<tensor::UnPackOp>();

if (!unPackOp) if (!unPackOp)

▲ Show 20 Lines • Show All 205 Lines • ▼ Show 20 Lines

}; };

} // namespace } // namespace

void mlir::linalg::populateDataLayoutPropagationPatterns( void mlir::linalg::populateDataLayoutPropagationPatterns(

RewritePatternSet &patterns, RewritePatternSet &patterns,

const ControlPropagationFn &controlPackUnPackPropagation) { const ControlPropagationFn &controlPackUnPackPropagation) {

patterns.insert<BubbleUpPackOpThroughGenericOpPattern, patterns.insert<BubbleUpPackOpThroughGenericOpPattern,

FoldFillPackIntoFillOpPattern,

PushDownUnPackOpThroughGenericOp, PushDownUnPackThroughPadOp>( PushDownUnPackOpThroughGenericOp, PushDownUnPackThroughPadOp>(

patterns.getContext(), controlPackUnPackPropagation); patterns.getContext(), controlPackUnPackPropagation);

} }

mlir/test/Dialect/Linalg/data-layout-propagation.mlir

	Show First 20 Lines • Show All 824 Lines • ▼ Show 20 Lines
	// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "parallel"]			// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "parallel"]
	// CHECK-SAME: ins(%[[PACK_ARG0]], %[[FILTER]]			// CHECK-SAME: ins(%[[PACK_ARG0]], %[[FILTER]]
	// CHECK-SAME: outs(%[[INIT]]			// CHECK-SAME: outs(%[[INIT]]
	// CHECK: %[[UNPACK_NEW_DEST:.+]] = tensor.empty() : tensor<16x540x960xi32>			// CHECK: %[[UNPACK_NEW_DEST:.+]] = tensor.empty() : tensor<16x540x960xi32>
	// CHECK: %[[UNPACK:.+]] = tensor.unpack %[[POOL]]			// CHECK: %[[UNPACK:.+]] = tensor.unpack %[[POOL]]
	// CHECK-SAME: inner_dims_pos = [0] inner_tiles = [16]			// CHECK-SAME: inner_dims_pos = [0] inner_tiles = [16]
	// CHECK-SAME: into %[[UNPACK_NEW_DEST]]			// CHECK-SAME: into %[[UNPACK_NEW_DEST]]
	// CHECK: return %[[UNPACK]] : tensor<16x540x960xi32>			// CHECK: return %[[UNPACK]] : tensor<16x540x960xi32>

				// -----

				func.func @fill_pack() -> tensor<24x32x16x16xf32> {
				cheliniUnsubmitted Done Reply Inline Actions Do we plan to fold the pack into the fill op in another PR? I would avoid the packing and make the fill operation work directly on the packed layout. Do you have some use cases where you want to preserve the packing? If so, we could do this folding in `populateSimplifyTensorPack`. chelini: Do we plan to fold the pack into the fill op in another PR? I would avoid the packing and make…
				hanchungAuthorUnsubmitted Done Reply Inline Actions Sounds really good to me, maybe we can do it in this PR.. hanchung: Sounds really good to me, maybe we can do it in this PR..
				cheliniUnsubmitted Not Done Reply Inline Actions Why do we want to pass %dest as arg for the fill, seems a bit wired a tensor level. I would create a tensor.empty inside the func. chelini: Why do we want to pass %dest as arg for the fill, seems a bit wired a tensor level. I would…
				hanchungAuthorUnsubmitted Done Reply Inline Actions It is not a big concern to me because sometimes a function can be inlined into a program. My intention was to create a minimal test for the pass. We can do this for static cases, but not dynamic shapes. We'll have to tie the dim sizes to something. Passing it as an argument is simplest way I've found. Let's do it for static shapes, and leave the dynamic case as what it is.. hanchung: It is not a big concern to me because sometimes a function can be inlined into a program. My…
				%dest = tensor.empty() : tensor<384x512xf32>
				%cst = arith.constant 0.000000e+00 : f32
				%0 = tensor.empty() : tensor<24x32x16x16xf32>
				%1 = linalg.fill ins(%cst : f32) outs(%dest : tensor<384x512xf32>) -> tensor<384x512xf32>
				%pack = tensor.pack %1 inner_dims_pos = [0, 1] inner_tiles = [16, 16] into %0 : tensor<384x512xf32> -> tensor<24x32x16x16xf32>
				return %pack : tensor<24x32x16x16xf32>
				}
				// CHECK-LABEL: func.func @fill_pack
				// CHECK: %[[PACKED_EMPTY:.+]] = tensor.empty() : tensor<24x32x16x16xf32>
				// CHECK: %[[FILL:.+]] = linalg.fill ins(%{{.+}}) outs(%[[PACKED_EMPTY]]
				// CHECK: return %[[FILL]]

				// -----

				#map = affine_map<()[s0] -> (s0 ceildiv 16)>
				func.func @dynamic_fill_pack(%arg0: tensor<?x?xf32>) -> tensor<?x?x16x16xf32> {
				cheliniUnsubmitted Done Reply Inline Actions same here. chelini: same here.
				%cst = arith.constant 0.000000e+00 : f32
				%c0 = arith.constant 0 : index
				%c1 = arith.constant 1 : index
				%0 = linalg.fill ins(%cst : f32) outs(%arg0 : tensor<?x?xf32>) -> tensor<?x?xf32>
				%dim = tensor.dim %0, %c0 : tensor<?x?xf32>
				%dim_0 = tensor.dim %0, %c1 : tensor<?x?xf32>
				%1 = affine.apply #map()[%dim]
				%2 = affine.apply #map()[%dim_0]
				%3 = tensor.empty(%1, %2) : tensor<?x?x16x16xf32>
				%pack = tensor.pack %0 padding_value(%cst : f32) inner_dims_pos = [0, 1] inner_tiles = [16, 16] into %3 : tensor<?x?xf32> -> tensor<?x?x16x16xf32>
				return %pack : tensor<?x?x16x16xf32>
				}
				// CHECK-DAG: #[[MAP:.+]] = affine_map<()[s0] -> (s0 ceildiv 16)>
				// CHECK: func.func @dynamic_fill_pack
				// CHECK-SAME: %[[DEST:[a-zA-Z0-9]+]]
				// CHECK-DAG: %[[C0:.+]] = arith.constant 0 : index
				// CHECK-DAG: %[[C1:.+]] = arith.constant 1 : index
				// CHECK: %[[D0:.+]] = tensor.dim %[[DEST]], %[[C0]]
				// CHECK: %[[D1:.+]] = tensor.dim %[[DEST]], %[[C1]]
				// CHECK: %[[PACKED_D0:.+]] = affine.apply #[[MAP]]()[%[[D0]]]
				// CHECK: %[[PACKED_D1:.+]] = affine.apply #[[MAP]]()[%[[D1]]]
				// CHECK: %[[PACKED_EMPTY:.+]] = tensor.empty(%[[PACKED_D0]], %[[PACKED_D1]]) : tensor<?x?x16x16xf32>
				// CHECK: %[[FILL:.+]] = linalg.fill ins(%{{.+}}) outs(%[[PACKED_EMPTY]]
				// CHECK: return %[[FILL]]

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][linalg] Add support for folding pack(fill) into fill.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 519934

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp

mlir/test/Dialect/Linalg/data-layout-propagation.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][linalg] Add support for folding pack(fill) into fill.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 519934

mlir/lib/Dialect/Linalg/Transforms/DataLayoutPropagation.cpp

mlir/test/Dialect/Linalg/data-layout-propagation.mlir

[mlir][linalg] Add support for folding pack(fill) into fill.
ClosedPublic