This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/
-
lib/
-
Passes/
-
PassBuilder.cpp
-
Transforms/IPO/
-
IPO/
-
PassManagerBuilder.cpp

Differential D102496

[Passes] Run vector-combine early with -fenable-matrix.
ClosedPublic

Authored by fhahn on May 14 2021, 6:50 AM.

Download Raw Diff

Details

Reviewers

anemet
spatel
RKSimon

Commits

rGa7c6471a8538: [Passes] Run vector-combine early with -fenable-matrix.

Summary

IR with matrix intrinsics is likely to also contain large vector
operations, which can benefit from early simplifications.

This is the last step in a series of changes to improve code-gen for
code using matrix subscript operators with the C/C++ matrix extension in
CLang, like

using matrix_t = double __attribute__((matrix_type(15, 15)));

void foo(unsigned i, matrix_t &A, matrix_t &B) {
  for (unsigned j = 0; j < 4; ++j)
    for (unsigned k = 0; k < i; k++)
      B[k][j] -= A[k][j] * B[i][j];
}

https://clang.godbolt.org/z/6dKxK1Ed7

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	40 ms	x64 debian > LLVM.Other::opt-O3-pipeline-enable-matrix.ll
	80 ms	x64 windows > LLVM.Other::opt-O3-pipeline-enable-matrix.ll

Event Timeline

fhahn created this revision.May 14 2021, 6:50 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptMay 14 2021, 6:50 AM

fhahn requested review of this revision.May 14 2021, 6:50 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 14 2021, 6:50 AM

fhahn added parent revisions: D102478: [Matrix] Emit assumption that matrix indices are valid., D102476: [VectorCombine] Use constant range info for index scalarization legality..May 14 2021, 6:51 AM

Any testcase to ensure that passes together now produce what you want?

Harbormaster completed remote builds in B104492: Diff 345429.May 14 2021, 8:12 AM

Some test coverage (phaseordering?) would be useful.

Thanks for taking a look! I added a phase ordering test and updated the pipeline tests as well.

Harbormaster completed remote builds in B104705: Diff 345704.May 16 2021, 5:52 AM

LGTM - anyone else have any comments?

Over in D102002, I am looking at divergence between the regular and LTO pipelines...
If I'm seeing this correctly, we will not alter the LTO pipeline in this patch. Is that intentional?

In D102496#2772019, @spatel wrote:

Over in D102002, I am looking at divergence between the regular and LTO pipelines...
If I'm seeing this correctly, we will not alter the LTO pipeline in this patch. Is that intentional?

Yes that's intentional. The motivation to run vector-combine early here is to catch combine & scalarization opportunities before operations are moved too much by GVN, unrolling & co. At the LTO stage, those should already be covered by the pre-LTO steps, so there's no need to do another run during the LTO stage I think.

In D102496#2773930, @fhahn wrote:

In D102496#2772019, @spatel wrote:

Over in D102002, I am looking at divergence between the regular and LTO pipelines...
If I'm seeing this correctly, we will not alter the LTO pipeline in this patch. Is that intentional?

Yes that's intentional. The motivation to run vector-combine early here is to catch combine & scalarization opportunities before operations are moved too much by GVN, unrolling & co. At the LTO stage, those should already be covered by the pre-LTO steps, so there's no need to do another run during the LTO stage I think.

Ah, I still haven't made sense of all the pipeline stages for LTO.
LGTM.

This revision is now accepted and ready to land.May 21 2021, 9:28 AM

fhahn added a parent revision: D110171: [VectorCombine] Switch to using a worklist..Sep 21 2021, 2:50 PM

Herald added a subscriber: ormris. · View Herald TranscriptSep 21 2021, 2:50 PM

Rebased. I am planning on landing this after D110171 lands.

Harbormaster completed remote builds in B124991: Diff 374041.Sep 21 2021, 2:52 PM

This revision was landed with ongoing or failed builds.Sep 22 2021, 4:49 AM

Closed by commit rGa7c6471a8538: [Passes] Run vector-combine early with -fenable-matrix. (authored by fhahn). · Explain Why

This revision was automatically updated to reflect the committed changes.

fhahn added a commit: rGa7c6471a8538: [Passes] Run vector-combine early with -fenable-matrix..

spatel mentioned this in D138353: [Passes][VectorCombine] enable early run generally and try load folds.Nov 19 2022, 7:47 AM

spatel mentioned this in rG8f337f8ffe36: [VectorCombine] generalize pass param name for early combines; NFC.Nov 21 2022, 10:58 AM

spatel mentioned this in rG163bb6d64e5f: [Passes][VectorCombine] enable early run generally and try load folds.

Revision Contents

Path

Size

llvm/

lib/

Passes/

PassBuilder.cpp

5 lines

Transforms/

IPO/

PassManagerBuilder.cpp