This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Change size suffix for FP16FML intrinsics.
ClosedPublic

Authored by ab on Feb 15 2019, 2:04 PM.

Details

Summary

These currently use _u32, but they should instead use _f32 or _f16, the types of the accumulator, and of the multiplication.

I'm starting with _f16 (because that seems to match the various integer vmlal variants), but either seems fine.

Diff Detail

Repository
rL LLVM

Event Timeline

ab created this revision.Feb 15 2019, 2:04 PM

I am discussing this with our GCC team as we would like both Clang/GCC implementation to be the same. But you're right that _f16 looks like to be the more consistent choice. I will let you know as soon I know more.

SjoerdMeijer accepted this revision.Feb 19 2019, 5:08 AM

LGTM

The ACLE has been updated and a new version with change included will be released soon.

This revision is now accepted and ready to land.Feb 19 2019, 5:08 AM
This revision was automatically updated to reflect the committed changes.
Herald added a project: Restricted Project. · View Herald TranscriptFeb 20 2019, 5:13 PM
ab added a comment.Feb 20 2019, 5:14 PM

Thanks for checking, much appreciated!