This is an archive of the discontinued LLVM Phabricator instance.

[MLIR][Linalg] introduce batch-reduce GEMM
ClosedPublic

Authored by chelini on Sep 19 2022, 12:04 AM.

Details

Summary

The batch-reduce GEMM kernel essentially multiplies a sequence of input tensor
blocks (which form a batch) and the partial multiplication results are reduced
into a single output tensor block.

See: https://ieeexplore.ieee.org/document/9139809 for more details.

Diff Detail

Event Timeline

chelini created this revision.Sep 19 2022, 12:04 AM
chelini requested review of this revision.Sep 19 2022, 12:04 AM
chelini updated this revision to Diff 461150.Sep 19 2022, 12:08 AM

Improve commit msg.

chelini edited the summary of this revision. (Show Details)Sep 19 2022, 12:09 AM
nicolasvasilache accepted this revision.Sep 19 2022, 2:48 AM

This is great, thank you Lorenzo!

This revision is now accepted and ready to land.Sep 19 2022, 2:48 AM
This revision was automatically updated to reflect the committed changes.