Download Raw Diff

Details

Reviewers

zvi
spatel
DavidKreitzer
RKSimon

Commits

rGc1d5955684db: [X86] Unsigned saturation subtraction canonicalization [the backend part]
rL315237: [X86] Unsigned saturation subtraction canonicalization [the backend part]

Summary

This is the backend part of unsigned saturation canonicalization patch. https://reviews.llvm.org/D37510

The patch transforms canonical version of unsigned saturation, which is sub(max(a,b),a) or sub(a,min(a,b)) to special psubus insturuction on targets, which support it(8bit and 16bit uints).
umax(a,b) - b -> subus(a,b)
a - umin(a,b) -> subus(a,b)

There is also extra case handled, when right part of sub is 32 bit and can be truncated, using UMIN(this transformation was discussed in https://reviews.llvm.org/D25987).

The example of special case code:

void foo(unsigned short *p, int max, int n) {

  int i;
  unsigned m;
  for (i = 0; i < n; i++) {
    m = *--p;
    *p = (unsigned short)(m >= max ? m-max : 0);
  }
}

Max in this example is truncated to max_short value, if it is greater than m, or just truncated to 16 bit, if it is not. It is vaid transformation, because if max > max_short, result of the expression will be zero.

Here is the table of types, I try to support, special case items are bold:

Size	128	256	512
i8	v16i8	v32i8	v64i8
i16	v8i16	v16i16	v32i16
i32		v8i32	v16i32
i64			v8i64

Diff Detail

Event Timeline

yulia_koval created this revision.Sep 6 2017, 1:46 PM

zvi added reviewers: zvi, spatel, DavidKreitzer.Sep 11 2017, 6:43 AM

zvi set the repository for this revision to rL LLVM.

zvi added a subscriber: llvm-commits.

Please add some details about the combine in the Summary. It's better to put a description in addition to the links to make it easier to read the commit log.
Maybe something like:
Combine:
umax(a,b) - b -> subus(a,b)
a - umin(a,b) -> subus(a,b)

And better add an X86 label to the title.

lib/Target/X86/X86ISelLowering.cpp
35489	Would a better place for this comment be right after the '// Try to find ... ' comment?
35508	Please drop braces for if/else with a single statement
35538	Can we generalize by using DAG.computeKnownBits to tells us that the upper bits are zeroed?

spatel mentioned this in D37849: [SelectionDAG] Add BITCAST handling to ComputeNumSignBits for splatted sign bits..Sep 14 2017, 8:10 AM

Added 512bit support and fixed comments

yulia_koval retitled this revision from Unsigned saturation subtraction canonicalization [the backend part] to [X86] Unsigned saturation subtraction canonicalization [the backend part].Sep 26 2017, 10:27 PM

yulia_koval edited the summary of this revision. (Show Details)

Please add the avx512vl feature to the AVX512 test (in the parent patch) and rebase this patch. It will drastically improve some of the cases :).

lib/Target/X86/X86ISelLowering.cpp
35645	Please drop the braces

yulia_koval updated this revision to Diff 116797.Sep 27 2017, 5:20 AM

zvi mentioned this in rL314305: X86 Tests: Unsigned saturation subtraction tests. NFC..Sep 27 2017, 7:39 AM

RKSimon added a reviewer: RKSimon.Sep 27 2017, 9:45 AM

zvi added inline comments.Sep 28 2017, 7:06 AM

test/CodeGen/X86/psubus.ll
2383 ↗	(On Diff #116797)	Looks like in this case SSE4.1, AVX1 and AVX2 regressed. Can you please take a look at it?

Fixed problem on min case.

yulia_koval marked an inline comment as done.Oct 2 2017, 12:58 AM

LGTM. Thanks!

This revision is now accepted and ready to land.Oct 9 2017, 12:12 AM

Thanks! Could you please commit it for me?

Closed by commit rL315237: [X86] Unsigned saturation subtraction canonicalization [the backend part] (authored by zvi). · Explain WhyOct 9 2017, 1:03 PM

This revision was automatically updated to reflect the committed changes.

spatel added inline comments.Oct 11 2017, 10:35 AM

llvm/trunk/lib/Target/X86/X86ISelLowering.cpp
35928–35930 ↗	(On Diff #118259)	Sorry I didn't have a chance to look at this closely before commit, but why are we using isEqualTo() here? This pattern only applies to integer types, so we could use simple equality checks instead?

Thanks, fixed your comment in additional patch below.

spatel mentioned this in rL315589: [x86] replace isEqualTo with == for efficiency.Oct 12 2017, 9:16 AM

In D37534#895588, @yulia_koval wrote:

Thanks, fixed your comment in additional patch below.

Thank you - committed with rL315589.

RKSimon mentioned this in D25987: [DAG] Match USUBSAT patterns through zext/trunc.Oct 26 2017, 2:31 PM

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Unsigned saturation subtraction canonicalization [the backend part]
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 118746

lib/Target/X86/X86ISelLowering.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Unsigned saturation subtraction canonicalization [the backend part]ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 118746

lib/Target/X86/X86ISelLowering.cpp

[X86] Unsigned saturation subtraction canonicalization [the backend part]
ClosedPublic