HomePhabricator

[AArch64]: BFloat MatMul Intrinsics&CodeGen

Authored by LukeGeeson on Jun 9 2020, 11:44 AM.

Description

[AArch64]: BFloat MatMul Intrinsics&CodeGen

This patch upstreams support for BFloat Matrix Multiplication Intrinsics
and Code Generation from __bf16 to AArch64. This includes IR intrinsics. Unittests are
provided as needed. AArch32 Intrinsics + CodeGen will come after this
patch.

This patch is part of a series implementing the Bfloat16 extension of
the
Armv8.6-a architecture, as detailed here:

https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a

The bfloat type, and its properties are specified in the Arm
Architecture
Reference Manual:

https://developer.arm.com/docs/ddi0487/latest/arm-architecture-reference-manual-armv8-for-armv8-a-architecture-profile

The following people contributed to this patch:

Luke Geeson

  • Momchil Velikov
  • Mikhail Maltsev
  • Luke Cheeseman

Reviewers: SjoerdMeijer, t.p.northover, sdesmalen, labrinea, miyuki,
stuij

Reviewed By: miyuki, stuij

Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits,
llvm-commits, miyuki, chill, pbarrio, stuij

Tags: #clang, #llvm

Differential Revision: https://reviews.llvm.org/D80752

Change-Id: I174f0fd0f600d04e3799b06a7da88973c6c0703f

Details

Committed
LukeGeesonJun 16 2020, 7:23 AM
Reviewer
miyuki
Differential Revision
D80752: [AArch64]: BFloat MatMul Intrinsics&CodeGen
Parents
rG508a4764c0ed: [AArch64]: BFloat Load/Store Intrinsics&CodeGen
Branches
Unknown
Tags
Unknown