This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
-
CommutativityUtils.cpp
-
test/
-
Transforms/
5
test-commutativity-utils.mlir
-
lib/Dialect/Test/
-
Dialect/
-
Test/
-
TestOps.td

Differential D157528

[mlir] Commutativity Operand Sorting Pattern Bug Fixes
Needs RevisionPublic

Authored by amandatang on Aug 9 2023, 10:52 AM.

Download Raw Diff

Details

Reviewers

nicolasvasilache
jpienaar
okwank
srishti-pm
okkwon

Summary

This patch addresses some bugs in the commutative operand sorting logic, adds some tests and performs some cleanup.

Bug 1:
The first bug occurs when comparing key sizes when at least one ancestorQueue is empty. If one ancestorQueue is not empty, its traversal isn't complete and the size comparison may be incorrect.

In the while loop of the comparison method, before comparing the entries at the current key index, if one of the ancestor queues is empty, then the sizes of the ancestor keys are compared.

Consider operandA and operandB with the following keys and ancestorQueues:

operand A key = {
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: BLOCK_ARGUMENT, opName: ""}, 
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand A ancestorQueue = empty

operand B key = {
    {type: NON_CONSTANT_OP, opName: "arith.divsi"}
}

operand B ancestorQueue = {block_argument, block_argument}

Operand A has a fully generated key and empty ancestor queue while Operand B is not fully generated. When they are compared, operand A should be smaller because at keyIndex 0, "arith.addi" is smaller than "arith.divsi". However, Operand A's ancestor queue is empty, so the following if statement is executed:

if (commOperandA->ancestorQueue.empty() || commOperandB->ancestorQueue.empty())
  return commOperandA->key.size() < commOperandB->key.size();

The comparison incorrectly returns false since operand A's key size is 3 and operand B's key size is 1.

To fix this, the entries at the current index must be checked and compared first if possible. The ancestor key sizes must only be compared if the current index exceeds one of the sizes and both ancestor keys are fully generated.

Bug 2:
The second bug lies in the visitedAncestors set skipping previously seen Operation pointers. The visitedAncestors DenseSet that keeps track of ancestors that were already visited attempts to keep track of and prevent checks on duplicate operands. The same operation may be used as multiple different operands and should not be treated as a duplicate.

Consider the following example:

%0 = arith.addi <block argument> <block argument>
%1 = arith.subi <block argument> <block argument>
%2 = arith.divsi <block argument> <block argument>
%3 = arith.muli %0, %0
%4 = arith.muli %1, %0
%5 = arith.muli %0, %2
operand A = arith.divsi %3, %4
operand B = arith.divsi %5, %4

The expected ancestor keys of operands A and B are:

operand A expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, // duplicate appearance of %0
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},  // duplicate appearance of %0
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand B expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},  // duplicate appearance of %0
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

The actual ancestor keys of operands A and B are:

operand A actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand B actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

When comparing the expected keys, operand A is smaller than operand B because at keyIndex 4, "arith.addi" is smaller than "arith.divsi". However, when comparing the actual keys, operand B ends up being smaller than operand A because at keyIndex 4, “arith.divsi” is smaller than “arith.subi”.

To fix this, visitedAncestors and its corresponding checks are removed completely.

Bug 3:
The third bug has to do with block arguments not being added to the ancestor queue and key.

The ancestorQueue is a queue of Operation pointers. Currently, ancestors are only added to the queue if they can be successfully cast to Operation pointers, which means Block Arguments are skipped. The key is formed by taking from the front of the ancestorQueue each time, so it also does not contain Block Arguments. This is fine for cases where Block Arguments are on the lowest level of the keys (at the end of the key). However, if Block Arguments appear at other levels, it can cause incorrect comparisons.

Consider the operandA and operandB in the following example:

%0 = arith.addi <block argument> <block argument>
%1 = arith.subi <block argument> <block argument>
operandA = arith.divsi <block argument>, %1
operandB = arith.divsi %0, %1
commutativeOp = commutative %operandA, %operandB

The expected ancestor keys of operandA and operandB are:

operandA expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: NON_CONSTANT_OP, opName: "arith.subi"},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

operandB expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},
  {type: NON_CONSTANT_OP, opName: "arith.subi"},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

The actual ancestor keys of operandA and operandB are:

operandA actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}
}

operandB actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},
  {type: NON_CONSTANT_OP, opName: "arith.subi"}
}

When comparing the expected keys, operandA is smaller than operandB because at keyIndex 1, the block argument is smaller than "arith.addi". However, when comparing the actual keys, operand B ends up being smaller than operand A because at keyIndex 1, “arith.addi” is smaller than “arith.subi”.

To fix this, Block Arguments are added to the ancestorQueue in the form of nullptr. The nullcheck for the Operation pointer cast is removed. The AncestorKey constructor which takes an Operation pointer already treats nullptr as Block Arguments, so nothing needs to be changed there.

Bug 4:
The fourth bug is caused by an assertion failure on the comparison function that is passed into stable_sort. stable_sort expects a strictly less than comparison function, which means it should return false for equivalent values.

Note that equivalent operands have the same key size and key values at each index. When the algorithm reaches a keyIndex that exceeds the operands' key sizes, it checks commOperandA first and separately from commOperandB. If the current keyIndex exceeds commOperandA's key size and commOperandA's ancestorQueue is empty, then the comparison function returns true, implying commOperandA is smaller regardless of commOperandB's key.

To fix this, before returning true if commOperandA's ancestorQueue is empty, commOperandB's key size and ancestorQueue is checked for equivalence.

Improvement:
The OpTraitRewritePattern which is a wrapper for the RewritePattern class can be used to perform matchAndRewrite on Ops with a specified trait.

Cleanup and Tests:
Example 2 in the comments is corrected. The key associated with operand %2 is missing a block argument entry at the end. Additionally, the final sorted operand ordering should be %4, %2, %3, %1. Both examples in the comments are added as tests.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

amandatang created this revision.Aug 9 2023, 10:52 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 9 2023, 10:52 AM

Herald added subscribers: bviyer, Moerafaat, zero9178 and 21 others. · View Herald Transcript

amandatang requested review of this revision.Aug 9 2023, 10:52 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptAug 9 2023, 10:52 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

amandatang edited the summary of this revision. (Show Details)Aug 9 2023, 11:00 AM

amandatang added reviewers: jpienaar, okwank.

amandatang added a reviewer: srishti-pm.

Harbormaster completed remote builds in B251430: Diff 548683.Aug 9 2023, 12:18 PM

run clang-format

Harbormaster completed remote builds in B251478: Diff 548754.Aug 9 2023, 4:52 PM

Can you make the summary precise? It's a bit long-winded and the bugs aren't clear from the summary. If you can explain a bug in one line or with a small example, it will be great. You don't need to explain the older functionality in detail.

In D157528#4575049, @srishti-pm wrote:

Can you make the summary precise? It's a bit long-winded and the bugs aren't clear from the summary. If you can explain a bug in one line or with a small example, it will be great. You don't need to explain the older functionality in detail.

Here are specific examples for each bug that do not behave as intended.

Bug 1:
The same commutative operand may be compared multiple times to reach a properly sorted state. Therefore, operand ancestor keys can be at different stages of the BFS traversal.
Consider operandA and operandB with the following keys and ancestorQueues:

operand A key = {
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: BLOCK_ARGUMENT, opName: ""}, 
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand A ancestorQueue = empty

operand B key = {
    {type: NON_CONSTANT_OP, opName: "arith.divsi"}
}

operand B ancestorQueue = {block_argument, block_argument}

if (commOperandA->ancestorQueue.empty() || commOperandB->ancestorQueue.empty())
  return commOperandA->key.size() < commOperandB->key.size();

The comparison incorrectly returns false since operand A's key size is 3 and operand B's key size is 1.

Bug 2:
The visitedAncestors DenseSet can exclude different operands with the same Operation *. Consider the following example:

%0 = arith.addi <block argument> <block argument>
%1 = arith.subi <block argument> <block argument>
%2 = arith.divsi <block argument> <block argument>
%3 = arith.muli %0, %0
%4 = arith.muli %1, %0
%5 = arith.muli %0, %2
operand A = arith.divsi %3, %4
operand B = arith.divsi %5, %4

The expected ancestor keys of operands A and B are:

operand A expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, // duplicate appearance of %0
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},  // duplicate appearance of %0
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand B expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},  // duplicate appearance of %0
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

The actual ancestor keys of operands A and B are:

operand A actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand B actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

Bug 3:
Block Arguments should be added to the ancestorQueue as nullptr so they can be correctly added to the key. Consider the operandA and operandB in the following example:

%0 = arith.addi <block argument> <block argument>
%1 = arith.subi <block argument> <block argument>
operandA = arith.divsi <block argument>, %1
operandB = arith.divsi %0, %1
commutativeOp = commutative %operandA, %operandB

The expected ancestor keys of operandA and operandB are:

operandA expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: NON_CONSTANT_OP, opName: "arith.subi"},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

operandB expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},
  {type: NON_CONSTANT_OP, opName: "arith.subi"},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

The actual ancestor keys of operandA and operandB are:

operandA actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}
}

operandB actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},
  {type: NON_CONSTANT_OP, opName: "arith.subi"}
}

Update comments

Harbormaster completed remote builds in B251748: Diff 549120.Aug 10 2023, 4:08 PM

Add test

@srishti-pm Let me know if these two examples make sense

In D157528#4575205, @amandatang wrote:
In D157528#4575049, @srishti-pm wrote:

Can you make the summary precise? It's a bit long-winded and the bugs aren't clear from the summary. If you can explain a bug in one line or with a small example, it will be great. You don't need to explain the older functionality in detail.

Bug 1:
The same commutative operand may be compared multiple times to reach a properly sorted state. Therefore, operand ancestor keys can be at different stages of the BFS traversal.
Consider operandA and operandB with the following keys and ancestorQueues:
operand A key = {
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: BLOCK_ARGUMENT, opName: ""}, 
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand A ancestorQueue = empty

operand B key = {
    {type: NON_CONSTANT_OP, opName: "arith.divsi"}
}

operand B ancestorQueue = {block_argument, block_argument}
Operand A has a fully generated key and empty ancestor queue while Operand B is not fully generated. When they are compared, operand A should be smaller because at keyIndex 0, "arith.addi" is smaller than "arith.divsi". However, Operand A's ancestor queue is empty, so the following if statement is executed:
if (commOperandA->ancestorQueue.empty() || commOperandB->ancestorQueue.empty())
  return commOperandA->key.size() < commOperandB->key.size();
The comparison incorrectly returns false since operand A's key size is 3 and operand B's key size is 1.

Bug 2:
The visitedAncestors DenseSet can exclude different operands with the same Operation *. Consider the following example:
%0 = arith.addi <block argument> <block argument>
%1 = arith.subi <block argument> <block argument>
%2 = arith.divsi <block argument> <block argument>
%3 = arith.muli %0, %0
%4 = arith.muli %1, %0
%5 = arith.muli %0, %2
operand A = arith.divsi %3, %4
operand B = arith.divsi %5, %4
The expected ancestor keys of operands A and B are:
operand A expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, // duplicate appearance of %0
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},  // duplicate appearance of %0
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand B expected key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"},  // duplicate appearance of %0
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}
The actual ancestor keys of operands A and B are:
operand A actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}

operand B actual key = {
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.muli"}, 
  {type: NON_CONSTANT_OP, opName: "arith.addi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.divsi"}, 
  {type: NON_CONSTANT_OP, opName: "arith.subi"}, 
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""},
  {type: BLOCK_ARGUMENT, opName: ""}
}
When comparing the expected keys, operand A is smaller than operand B because at keyIndex 4, "arith.addi" is smaller than "arith.divsi". However, when comparing the actual keys, operand B ends up being smaller than operand A because at keyIndex 4, “arith.divsi” is smaller than “arith.subi”.

Harbormaster completed remote builds in B252478: Diff 550112.Aug 14 2023, 8:11 PM

amandatang added a reviewer: okkwon.Aug 15 2023, 10:22 AM

Use strict weak ordering in the operand comparison method

Harbormaster completed remote builds in B253108: Diff 550982.Aug 16 2023, 9:43 PM

Additional tests

Harbormaster completed remote builds in B253545: Diff 551587.Aug 18 2023, 1:47 PM

Rename test

Harbormaster completed remote builds in B253598: Diff 551665.Aug 18 2023, 5:12 PM

amandatang edited the summary of this revision. (Show Details)Aug 21 2023, 11:02 AM

amandatang edited the summary of this revision. (Show Details)Aug 21 2023, 12:10 PM

Herald added a subscriber: wangpc. · View Herald TranscriptAug 21 2023, 12:10 PM

amandatang retitled this revision from [mlir] Commutativity sorting util bug fix to [mlir] Commutativity Operand Sorting Pattern Bug Fixes.Aug 21 2023, 12:35 PM

amandatang mentioned this in D158444: [mlir] Commutative Operand Sorting Pass.Aug 21 2023, 9:52 PM

Thank you for working on this. There are a few issues here. Listing them below:-

There are 7 new tests added in this patch. 5 of them work correctly without any bug fixes from your patch. Thus, all 5 of them should be removed/modified because they do not reflect the changes done in this patch. You should only add those tests that reflect the functional changes done in this patch. Upon looking closer into them, it seemed that some of them were directly pointing to a specific bug fix (for example @check_commutative_non_lowest_level_block_argument) but those tests were already working correctly before the fix.

There are 4 bug fixes being offered by this patch. Each of them warrants a separate commit of its own for better readability and ease of review and because they are all orthogonal to each other but fix the same utility. Please create one commit (and thus one patch) for each of the 4 fixes. The first fix can be done here and thus only 3 more commits are needed to be created for the 3 other fixes. Each commit will showcase test(s) that illustrate that specific bug fix. Further, since you are also doing cleanup, feel free to perform cleanup in any subset of these 4 commits.

Regarding bug 1: No need to explain what an ancestor key entry is and how it is compared. This detail is a black box with respect to this bug and is thus an unnecessary detail to add here. Thus, remove these lines: Each ancestor key entry consists of a type (block_argument, non_constant_op and constant_op in increasing order) and an op name if it is an op (empty for block arguments). An ancestor key entry is smaller if its type is smaller or if it has a smaller op name lexicographically.

Regarding bug 2: You mention: To fix this, visitedAncestors and its corresponding checks are removed completely. I think some explanation is needed on how the utility works with them removed and whether you have done something else to potentially do what they were doing, but now, correctly. A BFS requires some way to keep track of the visited nodes of the tree.

mlir/test/Transforms/test-commutativity-utils.mlir
3–16	Adding this test is unnecessary because it was working before any bug fixes.
17–38	Likewise for this test case as well.
154–173	Likewise.
175–200	Likewise.
280–305	Likewise.

This revision now requires changes to proceed.Aug 22 2023, 2:31 PM

In D157528#4586738, @amandatang wrote:

@srishti-pm Let me know if these two examples make sense

They make sense, yes. You can add them in the revision and commit summaries.

amandatang edited the summary of this revision. (Show Details)Aug 22 2023, 2:45 PM

In D157528#4608246, @srishti-pm wrote:

Thank you for working on this. There are a few issues here. Listing them below:-

There are 7 new tests added in this patch. 5 of them work correctly without any bug fixes from your patch. Thus, all 5 of them should be removed/modified because they do not reflect the changes done in this patch. You should only add those tests that reflect the functional changes done in this patch. Upon looking closer into them, it seemed that some of them were directly pointing to a specific bug fix (for example @check_commutative_non_lowest_level_block_argument) but those tests were already working correctly before the fix.

There are 4 bug fixes being offered by this patch. Each of them warrants a separate commit of its own for better readability and ease of review and because they are all orthogonal to each other but fix the same utility. Please create one commit (and thus one patch) for each of the 4 fixes. The first fix can be done here and thus only 3 more commits are needed to be created for the 3 other fixes. Each commit will showcase test(s) that illustrate that specific bug fix. Further, since you are also doing cleanup, feel free to perform cleanup in any subset of these 4 commits.

Regarding bug 1: No need to explain what an ancestor key entry is and how it is compared. This detail is a black box with respect to this bug and is thus an unnecessary detail to add here. Thus, remove these lines: Each ancestor key entry consists of a type (block_argument, non_constant_op and constant_op in increasing order) and an op name if it is an op (empty for block arguments). An ancestor key entry is smaller if its type is smaller or if it has a smaller op name lexicographically.

Regarding bug 2: You mention: To fix this, visitedAncestors and its corresponding checks are removed completely. I think some explanation is needed on how the utility works with them removed and whether you have done something else to potentially do what they were doing, but now, correctly. A BFS requires some way to keep track of the visited nodes of the tree.

Thank you for bringing up these points.

example1_test, example2_test and check_commutative_small_similar_ancestor_tree are the only tests that worked prior to the fixes in this patch. Some of the other tests also required more than one of the bug fixes in this patch to work correctly. When coming up with test cases, I was also considering coverage and other edge cases that would be nice to have checks for in case more changes are made in the future. That's why some of them do already work correctly without changes. I can add these in another separate patch following the 4 for these bug fixes.

I'll be separating these into their own patches.

Done.

The ancestorQueue and ancestor keys alone are sufficient in correctly performing the BFS traversal.

@srishti-pm In terms of separating the bug fixes into their own patches and having test cases that illustrate each bug fix, it's very difficult to write tests for each bug fix in isolation when all 4 are present at the same time in the original code. The examples I provided in the summary each assume everything else is correct. Some of the bug fixes, when moved to their own commit, also caused existing and previously correct test cases to fail.

In D157528#4608929, @amandatang wrote:

Some of the bug fixes, when moved to their own commit, also caused existing and previously correct test cases to fail.

Why will this happen if your fix is correct? I'd suggest making sure your fix is correct and doesn't have a bug in itself.

In D157528#4608941, @srishti-pm wrote:

In D157528#4608929, @amandatang wrote:

Some of the bug fixes, when moved to their own commit, also caused existing and previously correct test cases to fail.

Why will this happen if your fix is correct? I'd suggest making sure your fix is correct and doesn't have a bug in itself.

Because fixing each bug individually means the other 3 bugs remain unaddressed in each patch. Those unaddressed bugs causes problems in the tests that are not due to the fix itself. Sorry the wording was a bit confusing in my last comment.

In D157528#4610738, @amandatang wrote:

In D157528#4608941, @srishti-pm wrote:

In D157528#4608929, @amandatang wrote:

Some of the bug fixes, when moved to their own commit, also caused existing and previously correct test cases to fail.

Why will this happen if your fix is correct? I'd suggest making sure your fix is correct and doesn't have a bug in itself.

Because fixing each bug individually means the other 3 bugs remain unaddressed in each patch. Those unaddressed bugs causes problems in the tests that are not due to the fix itself. Sorry the wording was a bit confusing in my last comment.

You can have a specific ordering of commits in your local system that works for your tests. But, I'm not sure why this issue should happen for the existing tests, based on the bugs claimed and my understanding of the existing tests. It doesn't seem like any of the existing tests is working only because of a "stack of bugs".

I would suggest you try again to come up with specific tests for each bug. A unit test shouldn't be testing multiple orthogonal bug fixes together.

The issue here is that when all these fixes are combined in a single commit, it is impossible to actually review and verify their validity (I did try). It could be possible that an older bug is being unintentionally replaced by a new bug, or that the bug fix is an unintentional hack. A reviewer can't tell at this point. We want to avoid that scenario. That's why it is important to break a commit that is doing several different tasks into multiple smaller commits. Again, the size of a commit is not only defined by the number of code lines changed but also by the number of orthogonal tasks it is doing.

Herald added a subscriber: tstellar. · View Herald TranscriptAug 23 2023, 11:44 AM

Bug 2 and bug 3 are related. When the visitedAncestor check in line 152 is removed, it causes block arguments to not be added to the ancestorQueue. Originally, block arguments are only added because nullptr is not in the visitedAncestor set, so either way, the logic is not sound despite being functional. There could also be additional block argument cast and check to make this more safe.

if (!operandDefOp || !visitedAncestors.contains(operandDefOp))

In D157528#4611710, @amandatang wrote:
Bug 2 and bug 3 are related. When the visitedAncestor check in line 152 is removed, it causes block arguments to not be added to the ancestorQueue. Originally, block arguments are only added because nullptr is not in the visitedAncestor set, so either way, the logic is not sound despite being functional. There could also be additional block argument cast and check to make this more safe.
if (!operandDefOp || !visitedAncestors.contains(operandDefOp))

I see what issue you are facing.

You fixed bug 2 by doing this: visitedAncestors and its corresponding checks are removed completely. -> This is not "fixing bug 2" but rather "replacing bug 2 with a new bug". This is why you are facing the issue of interlinking between bug 2 and bug 3. You need to find a replacement for visitedAncestors, one that allows duplicated operands to be differentiated but also avoid visiting the same ancestor from the same operand position. I believe this can be done by taking into account, the operandNumber of the operand where the BFS is happening because that is unique, whether or not operands are duplicated. Once you do this, you will no longer have the weird issue of a relationship between fixing bug 2 and bug 3. In short, fix the fix of bug 2 to resolve the problem you are facing.

In the interest of time, I would still suggest you to have bug 1 and bug 4 fixes separated and ready. I will review them while you work on bug 2.

! In D157528#4608458, @amandatang wrote:
example1_test, example2_test and check_commutative_small_similar_ancestor_tree are the only tests that worked prior to the fixes in this patch.

The 5 (out of 7) new tests that are added in this patch that were already working without the fixes are example1_test, example2_test, check_commutative_non_lowest_level_block_argument, check_commutative_small_similar_ancestor_tree, and check_commutative_equal_ancestor_traversal_different_size. Every new test you add with a proposed fix should be failing before the fix. That is the only way it demonstrates the existence of a fix.

! In D157528#4608458, @amandatang wrote:
When coming up with test cases, I was also considering coverage and other edge cases that would be nice to have checks for in case more changes are made in the future. That's why some of them do already work correctly without changes. I can add these in another separate patch following the 4 for these bug fixes.

Yes, this is orthogonal work and should be done in a completely different patch, thanks. Although I don't believe example1_test and example2_test add any new test coverage. I haven't looked at the other 3 tests in detail. Will do that once your patches are broken into multiple patches, as discussed.

In D157528#4612148, @srishti-pm wrote:

! In D157528#4608458, @amandatang wrote:
example1_test, example2_test and check_commutative_small_similar_ancestor_tree are the only tests that worked prior to the fixes in this patch.

The 5 (out of 7) new tests that are added in this patch that were already working without the fixes are example1_test, example2_test, check_commutative_non_lowest_level_block_argument, check_commutative_small_similar_ancestor_tree, and check_commutative_equal_ancestor_traversal_different_size. Every new test you add with a proposed fix should be failing before the fix. That is the only way it demonstrates the existence of a fix.

! In D157528#4608458, @amandatang wrote:
When coming up with test cases, I was also considering coverage and other edge cases that would be nice to have checks for in case more changes are made in the future. That's why some of them do already work correctly without changes. I can add these in another separate patch following the 4 for these bug fixes.

Yes, this is orthogonal work and should be done in a completely different patch, thanks. Although I don't believe example1_test and example2_test add any new test coverage. I haven't looked at the other 3 tests in detail. Will do that once your patches are broken into multiple patches, as discussed.

For Bug 4, the assertion error initially was only encountered on the Windows version of the build in this patch. My local Linux environment and the Linux build were both unable to catch this. Although there is no previously broken test case that gets resolved via this fix, it is a big problem that will cause failures at runtime. The assertion exists within stable_sort to prevent the endless swapping of equal operands if the comparison method returns true for those cases.

In D157528#4612509, @amandatang wrote:

For Bug 4, the assertion error initially was only encountered on the Windows version of the build in this patch. My local Linux environment and the Linux build were both unable to catch this. Although there is no previously broken test case that gets resolved via this fix, it is a big problem that will cause failures at runtime. The assertion exists within stable_sort to prevent the endless swapping of equal operands if the comparison method returns true for those cases.

I understand that it was only encountered in Windows. It is good to work towards fixing such errors but don't combine all fixes together. That's my only ask.

Is this the only other new patch created: https://reviews.llvm.org/D158796 ? It seems to be moving around a lot of code from one file to the other. I'm confused as to whether or not such a movement is relevant to the fix claimed.

Revision Contents

Path

Size

mlir/

lib/

Transforms/

Utils/

CommutativityUtils.cpp

65 lines

test/

Transforms/

test-commutativity-utils.mlir

189 lines

lib/

Dialect/

Test/

TestOps.td

5 lines

Diff 551665

mlir/lib/Transforms/Utils/CommutativityUtils.cpp

Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines
struct CommutativeOperand {		struct CommutativeOperand {
/// Stores the operand.		/// Stores the operand.
Value operand;		Value operand;

/// Stores the queue of ancestors of the operand's BFS traversal at a		/// Stores the queue of ancestors of the operand's BFS traversal at a
/// particular point in time.		/// particular point in time.
std::queue<Operation *> ancestorQueue;		std::queue<Operation *> ancestorQueue;

/// Stores the list of ancestors that have been visited by the BFS traversal
/// at a particular point in time.
DenseSet<Operation *> visitedAncestors;

/// Stores the operand's "key". This "key" is defined as a list of the		/// Stores the operand's "key". This "key" is defined as a list of the
/// "AncestorKeys" associated with the ancestors of this operand, in a		/// "AncestorKeys" associated with the ancestors of this operand, in a
/// breadth-first order.		/// breadth-first order.
///		///
/// So, if an operand, say `A`, was produced as follows:		/// So, if an operand, say `A`, was produced as follows:
///		///
/// `<block argument>` `<block argument>`		/// `<block argument>` `<block argument>`
/// \ /		/// \ /
Show All 14 Lines	struct CommutativeOperand {
/// {type: `NON_CONSTANT_OP`, opName: "arith.subi"},		/// {type: `NON_CONSTANT_OP`, opName: "arith.subi"},
/// {type: `CONSTANT_OP`, opName: "arith.constant"},		/// {type: `CONSTANT_OP`, opName: "arith.constant"},
/// {type: `BLOCK_ARGUMENT`, opName: ""},		/// {type: `BLOCK_ARGUMENT`, opName: ""},
/// {type: `BLOCK_ARGUMENT`, opName: ""}		/// {type: `BLOCK_ARGUMENT`, opName: ""}
/// }		/// }
SmallVector<AncestorKey, 4> key;		SmallVector<AncestorKey, 4> key;

/// Push an ancestor into the operand's BFS information structure. This		/// Push an ancestor into the operand's BFS information structure. This
/// entails it being pushed into the queue (always) and inserted into the		/// entails it being pushed into the queue
/// "visited ancestors" list (iff it is an op rather than a block argument).		void pushAncestor(Operation *op) { ancestorQueue.push(op); }
void pushAncestor(Operation *op) {
ancestorQueue.push(op);
if (op)
visitedAncestors.insert(op);
}

/// Refresh the key.		/// Refresh the key.
///		///
/// Refreshing a key entails making it up-to-date with the operand's BFS		/// Refreshing a key entails making it up-to-date with the operand's BFS
/// traversal that has happened till that point in time, i.e, appending the		/// traversal that has happened till that point in time, i.e, appending the
/// existing key with the front ancestor's "AncestorKey". Note that a key		/// existing key with the front ancestor's "AncestorKey". Note that a key
/// directly reflects the BFS and thus needs to be refreshed during the		/// directly reflects the BFS and thus needs to be refreshed during the
/// progression of the traversal.		/// progression of the traversal.
void refreshKey() {		void refreshKey() {
if (ancestorQueue.empty())		if (ancestorQueue.empty())
return;		return;

Operation *frontAncestor = ancestorQueue.front();		Operation *frontAncestor = ancestorQueue.front();
AncestorKey frontAncestorKey(frontAncestor);		AncestorKey frontAncestorKey(frontAncestor);
key.push_back(frontAncestorKey);		key.push_back(frontAncestorKey);
}		}

/// Pop the front ancestor, if any, from the queue and then push its adjacent		/// Pop the front ancestor, if any, from the queue and then push its adjacent
/// unvisited ancestors, if any, to the queue (this is the main body of the		/// ancestors, if any, to the queue (this is the main body of the
/// BFS algorithm).		/// BFS algorithm).
void popFrontAndPushAdjacentUnvisitedAncestors() {		void popFrontAndPushAdjacentAncestors() {
if (ancestorQueue.empty())		if (ancestorQueue.empty())
return;		return;
Operation *frontAncestor = ancestorQueue.front();		Operation *frontAncestor = ancestorQueue.front();
ancestorQueue.pop();		ancestorQueue.pop();
if (!frontAncestor)		if (!frontAncestor)
return;		return;
for (Value operand : frontAncestor->getOperands()) {		for (Value operand : frontAncestor->getOperands()) {
Operation *operandDefOp = operand.getDefiningOp();		Operation *operandDefOp = operand.getDefiningOp();
if (!operandDefOp \|\| !visitedAncestors.contains(operandDefOp))
pushAncestor(operandDefOp);		pushAncestor(operandDefOp);
}		}
}		}
};		};

/// Sorts the operands of `op` in ascending order of the "key" associated with		/// Sorts the operands of `op` in ascending order of the "key" associated with
/// each operand iff `op` is commutative. This is a stable sort.		/// each operand iff `op` is commutative. This is a stable sort.
///		///
/// After the application of this pattern, since the commutative operands now		/// After the application of this pattern, since the commutative operands now
Show All 39 Lines
/// Here,		/// Here,
/// 1. The key associated with %1 is:		/// 1. The key associated with %1 is:
/// `{		/// `{
/// {CONSTANT_OP, "foo.const"}		/// {CONSTANT_OP, "foo.const"}
/// }`		/// }`
/// 2. The key associated with %2 is:		/// 2. The key associated with %2 is:
/// `{		/// `{
/// {NON_CONSTANT_OP, "foo.mul"},		/// {NON_CONSTANT_OP, "foo.mul"},
		/// {BLOCK_ARGUMENT, ""},
/// {BLOCK_ARGUMENT, ""}		/// {BLOCK_ARGUMENT, ""}
/// }`		/// }`
/// 3. The key associated with %3 is:		/// 3. The key associated with %3 is:
/// `{		/// `{
/// {NON_CONSTANT_OP, "foo.mul"},		/// {NON_CONSTANT_OP, "foo.mul"},
/// {NON_CONSTANT_OP, "foo.mul"},		/// {NON_CONSTANT_OP, "foo.mul"},
/// {CONSTANT_OP, "foo.const"},		/// {CONSTANT_OP, "foo.const"},
/// {BLOCK_ARGUMENT, ""},		/// {BLOCK_ARGUMENT, ""},
/// {BLOCK_ARGUMENT, ""}		/// {BLOCK_ARGUMENT, ""}
/// }`		/// }`
/// 4. The key associated with %4 is:		/// 4. The key associated with %4 is:
/// `{		/// `{
/// {NON_CONSTANT_OP, "foo.add"},		/// {NON_CONSTANT_OP, "foo.add"},
/// {NON_CONSTANT_OP, "foo.mul"},		/// {NON_CONSTANT_OP, "foo.mul"},
/// {CONSTANT_OP, "foo.const"},		/// {CONSTANT_OP, "foo.const"},
/// {BLOCK_ARGUMENT, ""},		/// {BLOCK_ARGUMENT, ""},
/// {BLOCK_ARGUMENT, ""}		/// {BLOCK_ARGUMENT, ""}
/// }`		/// }`
///		///
/// Thus, the sorted `foo.commutative` is:		/// Thus, the sorted `foo.commutative` is:
/// %5 = foo.commutative %4, %3, %2, %1		/// %5 = foo.commutative %4, %2, %3, %1
class SortCommutativeOperands : public RewritePattern {		struct SortCommutativeOperands final
public:		: public OpTraitRewritePattern<OpTrait::IsCommutative> {
SortCommutativeOperands(MLIRContext *context)		SortCommutativeOperands(MLIRContext *context)
: RewritePattern(MatchAnyOpTypeTag(), /benefit=/5, context) {}		: OpTraitRewritePattern<OpTrait::IsCommutative>(context, /benefit=/5) {}
LogicalResult matchAndRewrite(Operation *op,		LogicalResult matchAndRewrite(Operation *op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Custom comparator for two commutative operands, which returns true iff		// Custom comparator for two commutative operands, which returns true iff
// the "key" of `constCommOperandA` < the "key" of `constCommOperandB`,		// the "key" of `constCommOperandA` < the "key" of `constCommOperandB`,
// i.e.,		// i.e.,
// 1. In the first unequal pair of corresponding AncestorKeys, the		// 1. In the first unequal pair of corresponding AncestorKeys, the
// AncestorKey in `constCommOperandA` is smaller, or,		// AncestorKey in `constCommOperandA` is smaller, or,
// 2. Both the AncestorKeys in every pair are the same and the size of		// 2. Both the AncestorKeys in every pair are the same and the size of
Show All 11 Lines	auto commutativeOperandComparator =
const_cast<std::unique_ptr<CommutativeOperand> &>(		const_cast<std::unique_ptr<CommutativeOperand> &>(
constCommOperandB);		constCommOperandB);

// Iteratively perform the BFS's of both operands until an order among		// Iteratively perform the BFS's of both operands until an order among
// them can be determined.		// them can be determined.
unsigned keyIndex = 0;		unsigned keyIndex = 0;
while (true) {		while (true) {
if (commOperandA->key.size() <= keyIndex) {		if (commOperandA->key.size() <= keyIndex) {
		// Comparator must return false for equal elements
		// B is only larger if its key size is larger than the current
		// index or its ancestor queue is not empty
if (commOperandA->ancestorQueue.empty())		if (commOperandA->ancestorQueue.empty())
return true;		return commOperandB->key.size() > keyIndex \|\|
commOperandA->popFrontAndPushAdjacentUnvisitedAncestors();		!commOperandB->ancestorQueue.empty();
		commOperandA->popFrontAndPushAdjacentAncestors();
commOperandA->refreshKey();		commOperandA->refreshKey();
}		}
if (commOperandB->key.size() <= keyIndex) {		if (commOperandB->key.size() <= keyIndex) {
if (commOperandB->ancestorQueue.empty())		if (commOperandB->ancestorQueue.empty())
return false;		return false;
commOperandB->popFrontAndPushAdjacentUnvisitedAncestors();		commOperandB->popFrontAndPushAdjacentAncestors();
commOperandB->refreshKey();		commOperandB->refreshKey();
}		}
if (commOperandA->ancestorQueue.empty() \|\|		// Try comparing the keys at the current keyIndex
commOperandB->ancestorQueue.empty())		if (keyIndex < commOperandA->key.size() &&
return commOperandA->key.size() < commOperandB->key.size();		keyIndex < commOperandB->key.size()) {
if (commOperandA->key[keyIndex] < commOperandB->key[keyIndex])		if (commOperandA->key[keyIndex] < commOperandB->key[keyIndex])
return true;		return true;
if (commOperandB->key[keyIndex] < commOperandA->key[keyIndex])		if (commOperandB->key[keyIndex] < commOperandA->key[keyIndex])
return false;		return false;
		} else { // keyIndex exceeds one or both key sizes
		// Compare key sizes if the values at every possible keyIndex were
		// equal Both operands must have fully generated key and cannot
		// have anything in the ancestorQueue
		if (commOperandA->ancestorQueue.empty() &&
		commOperandB->ancestorQueue.empty())
		return commOperandA->key.size() < commOperandB->key.size();
		}
keyIndex++;		keyIndex++;
}		}
};		};

// If `op` is not commutative, do nothing.
if (!op->hasTrait<OpTrait::IsCommutative>())
return failure();

// Populate the list of commutative operands.		// Populate the list of commutative operands.
SmallVector<Value, 2> operands = op->getOperands();		SmallVector<Value, 2> operands = op->getOperands();
SmallVector<std::unique_ptr<CommutativeOperand>, 2> commOperands;		SmallVector<std::unique_ptr<CommutativeOperand>, 2> commOperands;
for (Value operand : operands) {		for (Value operand : operands) {
std::unique_ptr<CommutativeOperand> commOperand =		std::unique_ptr<CommutativeOperand> commOperand =
std::make_unique<CommutativeOperand>();		std::make_unique<CommutativeOperand>();
commOperand->operand = operand;		commOperand->operand = operand;
commOperand->pushAncestor(operand.getDefiningOp());		commOperand->pushAncestor(operand.getDefiningOp());
Show All 20 Lines

mlir/test/Transforms/test-commutativity-utils.mlir

// RUN: mlir-opt %s -test-commutativity-utils | FileCheck %s

// CHECK-LABEL: @example1_test

func.func @example1_test(%arg0 : i32, %arg1 : i32) -> i32 {

// CHECK-NEXT: %[[ARITH_CONST:.*]] = arith.constant

%0 = arith.constant 2 : i32

// CHECK-NEXT: %[[ARITH_MULI:.*]] = arith.muli

%1 = arith.muli %arg1, %arg0 : i32

// CHECK-NEXT: %[[RESULT:.*]] = "test.op_commutative2"(%[[ARITH_MULI]], %[[ARITH_CONST]])

%result = "test.op_commutative2"(%0, %1): (i32, i32) -> i32

// CHECK-NEXT: return %[[RESULT]]

return %result : i32

}

srishti-pmUnsubmitted

Not Done

// RUN: mlir-opt %s -test-commutativity-utils | FileCheck %s

- // CHECK-LABEL: @example1_test

- func.func @example1_test(%arg0 : i32, %arg1 : i32) -> i32 {

- // CHECK-NEXT: %[[ARITH_CONST:.*]] = arith.constant

- %0 = arith.constant 2 : i32

- // CHECK-NEXT: %[[ARITH_MULI:.*]] = arith.muli

- %1 = arith.muli %arg1, %arg0 : i32

- // CHECK-NEXT: %[[RESULT:.*]] = "test.op_commutative2"(%[[ARITH_MULI]], %[[ARITH_CONST]])

- %result = "test.op_commutative2"(%0, %1): (i32, i32) -> i32

- // CHECK-NEXT: return %[[RESULT]]

- return %result : i32

- }

// CHECK-LABEL: @example2_test

Adding this test is unnecessary because it was working before any bug fixes.

srishti-pm: Adding this test is unnecessary because it was working before any bug fixes.

// CHECK-LABEL: @example2_test

func.func @example2_test(%arg0 : i32, %arg1 : i32) -> i32 {

// CHECK-NEXT: %[[ARITH_CONST:.*]] = arith.constant

%0 = arith.constant 2 : i32

// CHECK-NEXT: %[[ARITH_MULI1:.*]] = arith.muli

%1 = arith.muli %arg1, %arg0 : i32

// CHECK-NEXT: %[[ARITH_MULI2:.*]] = arith.muli

%2 = arith.muli %1, %0 : i32

// CHECK-NEXT: %[[ARITH_ADDI:.*]] = arith.addi

%3 = arith.addi %1, %0 : i32

// CHECK-NEXT: %[[RESULT:.*]] = "test.op_commutative"(%[[ARITH_ADDI]], %[[ARITH_MULI1]], %[[ARITH_MULI2]], %[[ARITH_CONST]])

%result = "test.op_commutative"(%0, %1, %2, %3): (i32, i32, i32, i32) -> i32

// CHECK-NEXT: return %[[RESULT]]

return %result : i32

}

srishti-pmUnsubmitted

Not Done

return %result : i32

}

- // CHECK-LABEL: @example2_test

- func.func @example2_test(%arg0 : i32, %arg1 : i32) -> i32 {

- // CHECK-NEXT: %[[ARITH_CONST:.*]] = arith.constant

- %0 = arith.constant 2 : i32

- // CHECK-NEXT: %[[ARITH_MULI1:.*]] = arith.muli

- %1 = arith.muli %arg1, %arg0 : i32

- // CHECK-NEXT: %[[ARITH_MULI2:.*]] = arith.muli

- %2 = arith.muli %1, %0 : i32

- // CHECK-NEXT: %[[ARITH_ADDI:.*]] = arith.addi

- %3 = arith.addi %1, %0 : i32

- // CHECK-NEXT: %[[RESULT:.*]] = "test.op_commutative"(%[[ARITH_ADDI]], %[[ARITH_MULI1]], %[[ARITH_MULI2]], %[[ARITH_CONST]])

- %result = "test.op_commutative"(%0, %1, %2, %3): (i32, i32, i32, i32) -> i32

- // CHECK-NEXT: return %[[RESULT]]

- return %result : i32

- }

// CHECK-LABEL: @test_small_pattern_1

Likewise for this test case as well.

srishti-pm: Likewise for this test case as well.

// CHECK-LABEL: @test_small_pattern_1

func.func @test_small_pattern_1(%arg0 : i32) -> i32 {

// CHECK-NEXT: %[[ARITH_CONST:.*]] = arith.constant

%0 = arith.constant 45 : i32

// CHECK-NEXT: %[[TEST_ADD:.*]] = "test.addi"

%1 = "test.addi"(%arg0, %arg0): (i32, i32) -> i32

▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines

func.func @test_large_pattern(%arg0 : i32, %arg1 : i32) -> i32 {

%21 = arith.divsi %17, %20 : i32

// CHECK-NEXT: %[[RESULT:.*]] = "test.op_large_commutative"(%[[VAL16]], %[[VAL19]], %[[VAL19]], %[[VAL21]], %[[VAL6]], %[[VAL11]], %[[VAL15]])

%result = "test.op_large_commutative"(%16, %6, %11, %15, %19, %21, %19): (i32, i32, i32, i32, i32, i32, i32) -> i32

// CHECK-NEXT: return %[[RESULT]]

return %result : i32

}

// CHECK-LABEL: @check_commutative_non_lowest_level_block_argument

func.func @check_commutative_non_lowest_level_block_argument(%arg0 : i32, %arg1 : i32) -> i32 {

// CHECK-NEXT: arith.addi

%0 = arith.addi %arg0, %arg1 : i32

// CHECK-NEXT: arith.subi

%1 = arith.subi %arg0, %arg1 : i32

// CHECK-NEXT: %[[VAL1:.*]] = arith.divsi

%2 = arith.divsi %arg0, %1 : i32

// CHECK-NEXT: %[[VAL2:.*]] = arith.divsi

%3 = arith.divsi %0, %1 : i32

// CHECK-NEXT: %[[RESULT:.*]] = "test.op_commutative2"(%[[VAL1]], %[[VAL2]])

%result = "test.op_commutative2"(%3, %2): (i32, i32) -> i32

// CHECK-NEXT: return %[[RESULT]]

return %result : i32

}

srishti-pmUnsubmitted

Not Done

return %result : i32

}

- // CHECK-LABEL: @check_commutative_non_lowest_level_block_argument

- func.func @check_commutative_non_lowest_level_block_argument(%arg0 : i32, %arg1 : i32) -> i32 {

- // CHECK-NEXT: arith.addi

- %0 = arith.addi %arg0, %arg1 : i32

- // CHECK-NEXT: arith.subi

- %1 = arith.subi %arg0, %arg1 : i32

- // CHECK-NEXT: %[[VAL1:.*]] = arith.divsi

- %2 = arith.divsi %arg0, %1 : i32

- // CHECK-NEXT: %[[VAL2:.*]] = arith.divsi

- %3 = arith.divsi %0, %1 : i32

- // CHECK-NEXT: %[[RESULT:.*]] = "test.op_commutative2"(%[[VAL1]], %[[VAL2]])

- %result = "test.op_commutative2"(%3, %2): (i32, i32) -> i32

- // CHECK-NEXT: return %[[RESULT]]

- return %result : i32

- }

// CHECK-LABEL: @check_commutative_small_similar_ancestor_tree

Likewise.

srishti-pm: Likewise.

// CHECK-LABEL: @check_commutative_small_similar_ancestor_tree

func.func @check_commutative_small_similar_ancestor_tree(%arg0 : i32, %arg1 : i32) -> (i32, i32) {

// CHECK-NEXT: arith.addi

%0 = arith.addi %arg0, %arg0 : i32

// CHECK-NEXT: arith.subi

%1 = arith.subi %arg0, %arg1 : i32

// CHECK-NEXT: %[[VAL1:.*]] = arith.divsi

%2 = arith.divsi %0, %0 : i32

// CHECK-NEXT: %[[VAL2:.*]] = arith.divsi

%3 = arith.divsi %1, %1 : i32

// CHECK-NEXT: %[[VAL3:.*]] = arith.divsi

%4 = arith.divsi %0, %1 : i32

// CHECK-NEXT: %[[RESULT1:.*]] = "test.op_commutative3"(%[[VAL1]], %[[VAL3]], %[[VAL2]])

%result1 = "test.op_commutative3"(%2, %3, %4): (i32, i32, i32) -> i32

// CHECK-NEXT: %[[RESULT2:.*]] = "test.op_commutative3"(%[[VAL1]], %[[VAL3]], %[[VAL2]])

%result2 = "test.op_commutative3"(%4, %2, %3): (i32, i32, i32) -> i32

// CHECK-NEXT: return %[[RESULT1]], %[[RESULT2]]

return %result1, %result2 : i32, i32

}

srishti-pmUnsubmitted

Not Done

return %result : i32

}

- // CHECK-LABEL: @check_commutative_small_similar_ancestor_tree

- func.func @check_commutative_small_similar_ancestor_tree(%arg0 : i32, %arg1 : i32) -> (i32, i32) {

- // CHECK-NEXT: arith.addi

- %0 = arith.addi %arg0, %arg0 : i32

- // CHECK-NEXT: arith.subi

- %1 = arith.subi %arg0, %arg1 : i32

- // CHECK-NEXT: %[[VAL1:.*]] = arith.divsi

- %2 = arith.divsi %0, %0 : i32

- // CHECK-NEXT: %[[VAL2:.*]] = arith.divsi

- %3 = arith.divsi %1, %1 : i32

- // CHECK-NEXT: %[[VAL3:.*]] = arith.divsi

- %4 = arith.divsi %0, %1 : i32

- // CHECK-NEXT: %[[RESULT1:.*]] = "test.op_commutative3"(%[[VAL1]], %[[VAL3]], %[[VAL2]])

- %result1 = "test.op_commutative3"(%2, %3, %4): (i32, i32, i32) -> i32

- // CHECK-NEXT: %[[RESULT2:.*]] = "test.op_commutative3"(%[[VAL1]], %[[VAL3]], %[[VAL2]])

- %result2 = "test.op_commutative3"(%4, %2, %3): (i32, i32, i32) -> i32

- // CHECK-NEXT: return %[[RESULT1]], %[[RESULT2]]

- return %result1, %result2 : i32, i32

- }

// CHECK-LABEL: @check_commutative_large_similar_ancestor_tree

Likewise.

srishti-pm: Likewise.

// CHECK-LABEL: @check_commutative_large_similar_ancestor_tree

func.func @check_commutative_large_similar_ancestor_tree(%arg0 : i32, %arg1 : i32) -> (i32, i32) {

// CHECK-NEXT: arith.addi

%0 = arith.addi %arg0, %arg0 : i32

// CHECK-NEXT: arith.subi

%1 = arith.subi %arg0, %arg1 : i32

// CHECK-NEXT: arith.muli

%2 = arith.muli %0, %0 : i32

// CHECK-NEXT: arith.muli

%3 = arith.muli %1, %1 : i32

// CHECK-NEXT: arith.muli

%4 = arith.muli %0, %1 : i32

// CHECK-NEXT: arith.divsi

%5 = arith.divsi %2, %3 : i32

// CHECK-NEXT: arith.divsi

%6 = arith.divsi %3, %4 : i32

// CHECK-NEXT: arith.divsi

%7 = arith.divsi %2, %4 : i32

// CHECK-NEXT: %[[VAL1:.*]] = arith.subi

%8 = arith.subi %5, %6 : i32

// CHECK-NEXT: %[[VAL2:.*]] = arith.subi

%9 = arith.subi %5, %7 : i32

// CHECK-NEXT: %[[VAL3:.*]] = arith.subi

%10 = arith.subi %6, %7 : i32

// CHECK-NEXT: %[[RESULT1:.*]] = "test.op_commutative3"(%[[VAL2]], %[[VAL1]], %[[VAL3]])

%result1 = "test.op_commutative3"(%8, %9, %10): (i32, i32, i32) -> i32

// CHECK-NEXT: %[[RESULT2:.*]] = "test.op_commutative3"(%[[VAL2]], %[[VAL1]], %[[VAL3]])

%result2 = "test.op_commutative3"(%10, %8, %9): (i32, i32, i32) -> i32

// CHECK-NEXT: return %[[RESULT1]], %[[RESULT2]]

return %result1, %result2 : i32, i32

}

// CHECK-LABEL: @check_commutative_duplicate_ancestor

func.func @check_commutative_duplicate_ancestor(%arg0 : i32, %arg1 : i32) -> i32 {

// CHECK-NEXT: arith.addi

%0 = arith.addi %arg0, %arg1 : i32

// CHECK-NEXT: arith.subi

%1 = arith.subi %arg0, %arg1 : i32

// CHECK-NEXT: arith.divsi

%2 = arith.divsi %arg0, %arg1 : i32

// CHECK-NEXT: arith.muli

%3 = arith.muli %0, %0 : i32

// CHECK-NEXT: arith.muli

%4 = arith.muli %1, %0 : i32

// CHECK-NEXT: arith.muli

%5 = arith.muli %0, %2 : i32

// CHECK-NEXT: %[[VAL1:.*]] = arith.divsi

%6 = arith.divsi %3, %4 : i32

// CHECK-NEXT: %[[VAL2:.*]] = arith.divsi

%7 = arith.divsi %5, %4 : i32

// CHECK-NEXT: %[[RESULT:.*]] = "test.op_commutative2"(%[[VAL1]], %[[VAL2]])

%result = "test.op_commutative2"(%7, %6): (i32, i32) -> i32

// CHECK-NEXT: return %[[RESULT]]

return %result : i32

}

// CHECK-LABEL: @check_commutative_equal_ancestor_traversal_different_size

func.func @check_commutative_equal_ancestor_traversal_different_size(%arg0 : i32, %arg1 : i32) -> (i32, i32) {

// CHECK-NEXT: "test.one_variadic_out_one_variadic_in1"

%0 = "test.one_variadic_out_one_variadic_in1"(%arg0, %arg1, %arg0, %arg1, %arg0) : (i32, i32, i32, i32, i32) -> i32

// CHECK-NEXT: "test.one_variadic_out_one_variadic_in1"

%1 = "test.one_variadic_out_one_variadic_in1"(%0) : (i32) -> i32

// CHECK-NEXT: %[[VAL1:.*]] = "test.one_variadic_out_one_variadic_in1"

%2 = "test.one_variadic_out_one_variadic_in1"(%1) : (i32) -> i32

// CHECK-NEXT: "test.one_variadic_out_one_variadic_in1"

%3 = "test.one_variadic_out_one_variadic_in1"(%arg0, %arg1) : (i32, i32) -> i32

// CHECK-NEXT: %[[VAL2:.*]] = "test.one_variadic_out_one_variadic_in1"

%4 = "test.one_variadic_out_one_variadic_in1"(%3, %3) : (i32, i32) -> i32

// CHECK-NEXT: %[[RESULT1:.*]] = "test.op_commutative2"(%[[VAL2]], %[[VAL1]])

%result1 = "test.op_commutative2"(%2, %4): (i32, i32) -> i32

// CHECK-NEXT: %[[RESULT2:.*]] = "test.op_commutative2"(%[[VAL2]], %[[VAL1]])

%result2 = "test.op_commutative2"(%4, %2): (i32, i32) -> i32

// CHECK-NEXT: return %[[RESULT1]], %[[RESULT2]]

return %result1, %result2 : i32, i32

}

srishti-pmUnsubmitted

Not Done

return %result : i32

}

- // CHECK-LABEL: @check_commutative_equal_ancestor_traversal_different_size

- func.func @check_commutative_equal_ancestor_traversal_different_size(%arg0 : i32, %arg1 : i32) -> (i32, i32) {

- // CHECK-NEXT: "test.one_variadic_out_one_variadic_in1"

- %0 = "test.one_variadic_out_one_variadic_in1"(%arg0, %arg1, %arg0, %arg1, %arg0) : (i32, i32, i32, i32, i32) -> i32

- // CHECK-NEXT: "test.one_variadic_out_one_variadic_in1"

- %1 = "test.one_variadic_out_one_variadic_in1"(%0) : (i32) -> i32

- // CHECK-NEXT: %[[VAL1:.*]] = "test.one_variadic_out_one_variadic_in1"

- %2 = "test.one_variadic_out_one_variadic_in1"(%1) : (i32) -> i32

- // CHECK-NEXT: "test.one_variadic_out_one_variadic_in1"

- %3 = "test.one_variadic_out_one_variadic_in1"(%arg0, %arg1) : (i32, i32) -> i32

- // CHECK-NEXT: %[[VAL2:.*]] = "test.one_variadic_out_one_variadic_in1"

- %4 = "test.one_variadic_out_one_variadic_in1"(%3, %3) : (i32, i32) -> i32

- // CHECK-NEXT: %[[RESULT1:.*]] = "test.op_commutative2"(%[[VAL2]], %[[VAL1]])

- %result1 = "test.op_commutative2"(%2, %4): (i32, i32) -> i32

- // CHECK-NEXT: %[[RESULT2:.*]] = "test.op_commutative2"(%[[VAL2]], %[[VAL1]])

- %result2 = "test.op_commutative2"(%4, %2): (i32, i32) -> i32

- // CHECK-NEXT: return %[[RESULT1]], %[[RESULT2]]

- return %result1, %result2 : i32, i32

- }

Likewise.

srishti-pm: Likewise.

mlir/test/lib/Dialect/Test/TestOps.td

Show First 20 Lines • Show All 1,272 Lines • ▼ Show 20 Lines	def TestLargeCommutativeOp : TEST_Op<"op_large_commutative", [Commutative]> {
let results = (outs I32);		let results = (outs I32);
}		}

def TestCommutative2Op : TEST_Op<"op_commutative2", [Commutative]> {		def TestCommutative2Op : TEST_Op<"op_commutative2", [Commutative]> {
let arguments = (ins I32:$op1, I32:$op2);		let arguments = (ins I32:$op1, I32:$op2);
let results = (outs I32);		let results = (outs I32);
}		}

		def TestCommutative3Op : TEST_Op<"op_commutative3", [Commutative]> {
		let arguments = (ins I32:$op1, I32:$op2, I32:$op3);
		let results = (outs I32);
		}

def TestIdempotentTraitOp		def TestIdempotentTraitOp
: TEST_Op<"op_idempotent_trait",		: TEST_Op<"op_idempotent_trait",
[SameOperandsAndResultType, NoMemoryEffect, Idempotent]> {		[SameOperandsAndResultType, NoMemoryEffect, Idempotent]> {
let arguments = (ins I32:$op1);		let arguments = (ins I32:$op1);
let results = (outs I32);		let results = (outs I32);
}		}

def TestIdempotentTraitBinaryOp		def TestIdempotentTraitBinaryOp
▲ Show 20 Lines • Show All 1,543 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Commutativity Operand Sorting Pattern Bug FixesNeeds RevisionPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 551665

mlir/lib/Transforms/Utils/CommutativityUtils.cpp

mlir/test/Transforms/test-commutativity-utils.mlir

mlir/test/lib/Dialect/Test/TestOps.td

[mlir] Commutativity Operand Sorting Pattern Bug Fixes
Needs RevisionPublic