HomePhabricator

[AArch64] Select saturating Neon instructions

Authored by dmgreen on Oct 31 2019, 8:22 AM.

Description

[AArch64] Select saturating Neon instructions

This adds some extra patterns to select AArch64 Neon SQADD, UQADD, SQSUB
and UQSUB from the existing target independent sadd_sat, uadd_sat,
ssub_sat and usub_sat nodes.

It does not attempt to replace the existing int_aarch64_neon_uqadd
intrinsic nodes as they are apparently used for both scalar and vector,
and need to be legal on scalar types for some of the patterns to work.
The int_aarch64_neon_uqadd on scalar would move the two integers into
floating point registers, perform a Neon uqadd and move the value back.
I don't believe this is good idea for uadd_sat to do the same as the
scalar alternative is simpler (an adds with a csinv). For signed it may
be smaller, but I'm not sure about it being better.

So this just adds some extra patterns for the existing vector
instructions, matching on the _sat nodes.

Differential Revision: https://reviews.llvm.org/D69374

Details

Committed
dmgreenOct 31 2019, 10:28 AM
Differential Revision
D69374: [AArch64] Select sadd_sat, uadd_sat, usub_sat and ssub_sat.
Parents
rG62c0746896f9: [lit] Rename ProgressDisplay -> Display
Branches
Unknown
Tags
Unknown