This is an archive of the discontinued LLVM Phabricator instance.

Differential D109742

[mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOp results
ClosedPublic

Authored by springerm on Sep 13 2021, 11:49 PM.

Download Raw Diff

Details

Reviewers

nicolasvasilache

Commits

rG934e2f695e18: [mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOp results

Summary

E.g.:

%2 = memref.alloc() {alignment = 128 : i64} : memref<256x256xf32>
%3 = memref.alloc() {alignment = 128 : i64} : memref<256x256xf32>

// ... (%3 is not written to)

linalg.copy(%3, %2) : memref<256x256xf32>, memref<256x256xf32>
vector.transfer_write %11, %2[%c0, %c0] {in_bounds = [true, true]} : vector<256x256xf32>, memref<256x256xf32>

Avoid copies of %3 if %3 came directly from an InitTensorOp.

Depends On D109741

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

springerm created this revision.Sep 13 2021, 11:49 PM

Herald added subscribers: wenzhicui, wrengr, Chia-hungDuan and 20 others. · View Herald TranscriptSep 13 2021, 11:49 PM

springerm requested review of this revision.Sep 13 2021, 11:49 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 13 2021, 11:49 PM

Herald added subscribers: limo1996, stephenneuendorffer. · View Herald Transcript

Harbormaster completed remote builds in B123800: Diff 372413.Sep 14 2021, 12:37 AM

nicolasvasilache accepted this revision.Sep 15 2021, 12:31 AM

This revision is now accepted and ready to land.Sep 15 2021, 12:31 AM

Closed by commit rG934e2f695e18: [mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOp results (authored by springerm). · Explain WhySep 15 2021, 1:32 AM

This revision was automatically updated to reflect the committed changes.

springerm added a commit: rG934e2f695e18: [mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOp results.

Revision Contents

Path

Size

mlir/

lib/

Dialect/

Linalg/

Transforms/

ComprehensiveBufferize.cpp

3 lines

Diff 372413

mlir/lib/Dialect/Linalg/Transforms/ComprehensiveBufferize.cpp

Show First 20 Lines • Show All 2,184 Lines • ▼ Show 20 Lines	static LogicalResult bufferize(OpBuilder &b, VectorTransferOpInterface op,

// If transfer_write is not inPlace, allocate a new buffer.		// If transfer_write is not inPlace, allocate a new buffer.
Value newInputBuffer;		Value newInputBuffer;
if (inPlace != InPlaceSpec::True) {		if (inPlace != InPlaceSpec::True) {
// Alloc a copy for `writeOp.source()`, it will become the result buffer.		// Alloc a copy for `writeOp.source()`, it will become the result buffer.
newInputBuffer = createNewAllocDeallocPairForShapedValue(		newInputBuffer = createNewAllocDeallocPairForShapedValue(
b, loc, writeOp.source(), aliasInfo);		b, loc, writeOp.source(), aliasInfo);
Value v = lookup(bvm, writeOp.source());		Value v = lookup(bvm, writeOp.source());
		if (!isInitTensorOp(writeOp.source()))
b.create<CopyOp>(loc, v, newInputBuffer);		b.create<CopyOp>(loc, v, newInputBuffer);
} else {		} else {
// InPlace write will result in memref.tensor_load(x) which must		// InPlace write will result in memref.tensor_load(x) which must
// canonicalize away with one of it uses.		// canonicalize away with one of it uses.
newInputBuffer = lookup(bvm, writeOp.source());		newInputBuffer = lookup(bvm, writeOp.source());
assert(newInputBuffer && "missing buffer");		assert(newInputBuffer && "missing buffer");
}		}

// Create a new transfer_write on buffer that doesn't have a return value.		// Create a new transfer_write on buffer that doesn't have a return value.
▲ Show 20 Lines • Show All 766 Lines • Show Last 20 Lines