Download Raw Diff

Details

Reviewers

efriedma
craig.topper
spatel
foad
pengfei
lebedev.ri
RKSimon

Summary

In the same vein as D127115, this is step toward processing the dag in topological order.

This is a bit of a shotgun diff and will require many issue to be fixed before proceeding.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	590 ms	x64 debian > Clang.CodeGen/SystemZ::builtins-systemz-zvector3-constrained.c
	770 ms	x64 debian > Clang.CodeGen/SystemZ::builtins-systemz-zvector3.c
	60 ms	x64 debian > LLVM.CodeGen/AArch64::alloca.ll
	120 ms	x64 debian > LLVM.CodeGen/AArch64::arm64-abi_align.ll
	160 ms	x64 debian > LLVM.CodeGen/AArch64::arm64-bitfield-extract.ll
		View Full Test Results (277 Failed)

Event Timeline

deadalnix created this revision.Jun 14 2023, 8:52 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 14 2023, 8:52 AM

Herald added subscribers: armkevincheng, sjarus, eric-k256 and 3 others. · View Herald Transcript

deadalnix requested review of this revision.Jun 14 2023, 8:52 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 14 2023, 8:52 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

deadalnix retitled this revision from [RFX][DAG] Initially add nodes in the worklist in topological order. to [RFC][DAG] Initially add nodes in the worklist in topological order..Jun 14 2023, 8:53 AM

Harbormaster completed remote builds in B238828: Diff 531362.Jun 14 2023, 9:30 AM

n-omer added a subscriber: n-omer.Jun 14 2023, 12:39 PM

oh no, not again :) A lot of these seem to be related to changes in extension/truncations - although I haven't noticed any common pattern

In D152928#4424415, @RKSimon wrote:

oh no, not again :) A lot of these seem to be related to changes in extension/truncations - although I haven't noticed any common pattern

This one should be the most disruptive one. After that, it's mostly in topological order, and there will be a couple of patch up to ensure this is always the case, but disturbance should be minimal.

Fix numerous tests

Herald added subscribers: wangpc, luke, pmatos and 34 others. · View Herald TranscriptJun 16 2023, 6:41 AM

Harbormaster completed remote builds in B239401: Diff 532126.Jun 16 2023, 7:14 AM

This causes many interesting regressions for AMDGPU!

Herald added a subscriber: davidegrohmann. · View Herald TranscriptJun 20 2023, 3:40 AM

RKSimon added inline comments.Jun 20 2023, 4:28 AM

llvm/test/CodeGen/X86/abds.ll
31	t0: ch,glue = EntryToken t2: i32,ch = CopyFromReg t0, Register:i32 %0 t3: i8 = truncate t2 t7: i64 = sign_extend t3 t5: i32,ch = CopyFromReg t0, Register:i32 %1 t6: i8 = truncate t5 t8: i64 = sign_extend t6 t9: i64 = sub t7, t8 t10: i64 = abs t9 t11: i8 = truncate t10 This has regressed because the sign_extend(truncate()) have already been folded to sign_extend_inreg(any_extend()) before foldABSToABD is called via visitTRUNCATE, meaning we only call it later via visitABS directly and lose the smaller types.

Another batch of tests

Herald added a subscriber: nemanjai. · View Herald TranscriptJun 25 2023, 7:18 AM

More tests, notably a lot of RISCV ones.

Harbormaster completed remote builds in B241035: Diff 534352.Jun 25 2023, 10:23 AM

As I go on, the changeset is becoming so big that phabricator is unable to display it. Maybe this isn't the right approach, but then, what's the right approach?

I'd probably focus initially on x86 only as that will have the most test changes and the most overlap with other targets.

RKSimon mentioned this in rG63f1ca11a6fe: [X86] Generalize combineVectorTruncationWithPACKUS/combineVectorTruncationWithP….Jun 26 2023, 4:40 AM

Revert to showing X86 tests only for now.

Harbormaster completed remote builds in B241149: Diff 534517.Jun 26 2023, 6:55 AM

RKSimon added inline comments.Jun 28 2023, 6:13 AM

llvm/test/CodeGen/X86/avx2-shift.ll
416	Many of the vector regressions are due to X86's combineVectorTruncation fold that is destroying truncate patterns too soon. We should be able to get rid of it with a little more work.

dmgreen mentioned this in D153972: [AArch64] Fold tree of offset loads combine.Jun 29 2023, 12:45 AM

deadalnix mentioned this in D154522: [DAG] Improve combineCarryDiamond to accept (uaddo_carry X, 0, Carry).Jul 5 2023, 9:30 AM

rebase on top of D154522

Harbormaster completed remote builds in B243254: Diff 537407.Jul 5 2023, 11:42 AM

RKSimon mentioned this in D154592: [X86] LowerTRUNCATE - improve handling during type legalization to PACKSS/PACKUS patterns.Jul 6 2023, 3:33 AM

Rebase on top of D154592

Harbormaster completed remote builds in B243725: Diff 538066.Jul 7 2023, 3:55 AM

RKSimon mentioned this in rG842a6728d950: [X86] LowerTRUNCATE - improve handling during type legalization to….Jul 11 2023, 2:40 AM

RKSimon mentioned this in D146121: [DAG] Move lshr narrowing from visitANDLike to SimplifyDemandedBits.Jul 12 2023, 2:39 AM

RKSimon mentioned this in rG8d598531b3f5: Revert rGf269877dc30777354be8a512e871aba1b1f9fd7a "[X86]….Jul 13 2023, 6:12 AM

Rebase on top of variosu patches by @RKSimon

Harbormaster completed remote builds in B245365: Diff 540381.Jul 14 2023, 5:54 AM

Fix tests

Harbormaster completed remote builds in B245393: Diff 540422.Jul 14 2023, 8:31 AM

Rebase and regenrate a bunch of tests

Harbormaster completed remote builds in B247447: Diff 543234.Jul 22 2023, 5:01 PM

@deadalnix Please can you rebase? A lot of the vector truncate issues should be fixed now

RKSimon mentioned this in rG3c2432690ad2: [X86] Remove combineVectorSignBitsTruncation and leave TRUNCATE ->….Aug 17 2023, 4:23 AM

RKSimon added inline comments.Aug 20 2023, 9:14 AM

llvm/test/CodeGen/X86/vselect.ll
5	We need to add AVX1/AVX2 prefixes back here: ; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx \| FileCheck %s --check-prefix=AVX1,AVX1 ; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx2 \| FileCheck %s --check-prefixes=AVX,AVX2

llvm/test/CodeGen/X86/vselect.ll

We need to add AVX1/AVX2 prefixes back here:

; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx | FileCheck %s --check-prefix=AVX1,AVX1
; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx2 | FileCheck %s --check-prefixes=AVX,AVX2

rebase

Harbormaster completed remote builds in B257889: Diff 557805.Oct 20 2023, 3:18 AM

Given that this is not going to be resolved anytime soon and we're relatively early into the review, would you be open to moving this to a GitHub PR?

In D152928#4654624, @RKSimon wrote:

Given that this is not going to be resolved anytime soon and we're relatively early into the review, would you be open to moving this to a GitHub PR?

If this is what it takes, but I'm really not thrilled by the move to be honest. It's way harder to keep track of changes, notification are basically useless, and there is no kind of useful dashboard of what one need to pay attention to either.

RKSimon mentioned this in rGac534d2a16bb: [X86] combineArithReduction - use PACKUSWB directly for PSADBW(TRUNCATE(v8i16….Oct 25 2023, 6:57 AM

In D152928#4654641, @deadalnix wrote:

In D152928#4654624, @RKSimon wrote:

Given that this is not going to be resolved anytime soon and we're relatively early into the review, would you be open to moving this to a GitHub PR?

If this is what it takes, but I'm really not thrilled by the move to be honest. It's way harder to keep track of changes, notification are basically useless, and there is no kind of useful dashboard of what one need to pay attention to either.

Yes its as much fun as a tooth extraction :| But I'm worried that Phab will not get much love going forward and we're months away from getting the x86 regressions fixed, let alone any other backend.

I've started my own WIP branch of this patch at https://github.com/RKSimon/llvm-project/tree/perf/D152928 so I can more easily work though addressing issues; I'll be rebasing + forcing pushes so don't rely on git history if you track it. But wherever you decide to keep the main patch will remain the reference.

RKSimon mentioned this in rG432649700db1: [X86] vec_insert-5.ll - ensure we build with +mmx as we reference x86_mmx types.Oct 30 2023, 5:43 AM

RKSimon mentioned this in rGde4139689519: [DAG] foldABSToABD - add support for abs(sub(sign_extend_inreg()….Nov 15 2023, 7:50 AM

GitHub <noreply@github.com> mentioned this in rG761a963dfc8f: [DAG] narrowExtractedVectorBinOp - ensure we limit late node creation to….Nov 20 2023, 2:56 AM

GitHub <noreply@github.com> mentioned this in rG7b1e4239b396: [DAG] Fold (vt trunc (extload (vt x))) -> (vt load x) (#75229).Dec 18 2023, 8:21 AM

GitHub <noreply@github.com> mentioned this in rGd460c1de3b98: [DAG] SimplifyDemandedBits - don't fold sext(x) -> aext(x) if we lose an 0/-1….Mon, Jan 15, 1:19 PM

Diff	ID	Base	Description	Created	Lint	Unit
Base			Base
Diff 1	531362	e559f27		Jun 14 2023, 8:52 AM	★	★
Diff 2	532126	f9f8517	Fix numerous tests	Jun 16 2023, 6:41 AM	★	★
Diff 3	534335	93af6bd	Another batch of tests	Jun 25 2023, 7:18 AM	★	★
Diff 4	534352	b3c8554	More tests, notably a lot of RISCV ones.	Jun 25 2023, 8:58 AM	★	★
Diff 5	534517	9feed59	Revert to showing X86 tests only for now.	Jun 26 2023, 6:18 AM	★	★
Diff 6	537407	fe5e3be	rebase on top of D154522	Jul 5 2023, 10:07 AM	★	★
Diff 7	538066	0e52849	Rebase on top of D154592	Jul 7 2023, 3:54 AM	★	★
Diff 8	540381	3bb6dd2	Rebase on top of variosu patches by @RKSimon	Jul 14 2023, 5:36 AM	★	★
Diff 9	540422	3bb6dd2	Fix tests	Jul 14 2023, 8:03 AM	★	★
Diff 10	543234	c3b504b	Rebase and regenrate a bunch of tests	Jul 22 2023, 4:24 PM	★	★
Diff 11	557805	7e40371	rebase	Oct 20 2023, 3:10 AM	★	★

This is an archive of the discontinued LLVM Phabricator instance.

[RFC][DAG] Initially add nodes in the worklist in topological order.
Needs ReviewPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

This is an archive of the discontinued LLVM Phabricator instance.

[RFC][DAG] Initially add nodes in the worklist in topological order.Needs ReviewPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

[RFC][DAG] Initially add nodes in the worklist in topological order.
Needs ReviewPublic