This is an archive of the discontinued LLVM Phabricator instance.

include/llvm/CodeGen/ISDOpcodes.h
180 ↗	(On Diff #85346)	Please add more documentation here. In particular, a very short form of the explanation from SelectionDAGISel::Select_FREEZE() about the difference in semantics between FREEZE and UNDEF would be appropriate.
lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2238 ↗	(On Diff #85346)	I'm a bit confused about this. The function above selects UNDEF to IMPLICIT_DEF. Where do we do the duplication? If we duplicate UNDEFs on the DAG level, then everything's already fine, and you can just select a FREEZE directly to IMPLICIT_DEF. If we're duplicating IMPLICIT_DEFs, then this is a problem regardless of what you do here, if the freeze ends up being an IMPLICIT_DEF. Or are you saying we're duplicating IMPLICIT_DEFs in the DAG, but not in MI? I'd find that surprising. (Note that the documentation for IMPLICIT_DEF just says that "this is the MachineInstr-level equivalent of undef." If we're not allowed to duplicate IMPLICIT_DEFs and they have a single defined value, that should be corrected.)

Also, I didn't see a corresponding patch for FastISel. Did I miss one?

nlopes added a subscriber: nlopes.Jan 25 2017, 4:17 AM

nlopes added inline comments.

lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2238 ↗	(On Diff #85346)	If you have e.g.: mul undef, undef this eventually may get translated into: %v1 = IMPLICIT_DEF %v2 = IMPLICIT_DEF %v3 = mul %v1, %v2 Even if it doesn't get translated to this explicitly, the semantics of UNDEF/IMPLICIT_DEF are that each read may see a different value. For example, IMPLICIT_DEF can be sank into a loop and each iteration may see a different value, and known-bits analyses will also take advantage of the fact that each bit can take any value it wants. Therefore we use this "hack" of making a copy of the vreg where IMPLICIT_DEF is assigned to. This seems to guarantee that all uses see the same value, since there isn't a pass propagating IMPLICIT_DEF to copies.

FastISel for FREEZE added.
Updated description of FREEZE.

aqjune marked an inline comment as done.Jan 26 2017, 1:29 AM

trentxintong added a subscriber: trentxintong.Jan 26 2017, 12:28 PM

mkuper added inline comments.Jan 26 2017, 2:36 PM

include/llvm/CodeGen/ISDOpcodes.h
180 ↗	(On Diff #85743)	random -> arbitrary (sorry, it's a pet peeve :-) )
lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2238 ↗	(On Diff #85346)	If I understand correctly, the freeze.ll test shows you do end up with just an IMPLICIT_DEF eventually. So I'm not sure where the benefit is? You rely on backend phase ordering to elide the copy only after the last time IMPLICIT_DEF may be duplicated? This seems a bit fishy. Anyway, this is just a drive-by, I defer to Quentin and the other if they think this is reasonable.

aqjune updated this revision to Diff 86018.Jan 26 2017, 11:23 PM

aqjune marked an inline comment as done.

regehr added a subscriber: regehr.Jan 28 2017, 10:03 PM

Rebased to svn commit 308173

Rebased to svn commit 331585

All these new llc tests - have you considered using utils/update_llc_test_checks.py ?

Rebase to trunk@369887 (August 26th, 2019)

I guess this is the next patch in the queue?
This needs rebasing, and more tests to match the langref wording.

Support floating points, pointers, aggregate types, add tests for those types

I believe vector legalization tests are missing.

Cost model handling might be missing (i would guess they should be treated as free)

This looks reasonable-ish, but i don't have much knowledge on this part, sadly.
I'm wondering if @RKSimon / @spatel / @craig.topper might want to help review this?

include/llvm/CodeGen/ISDOpcodes.h
179 ↗	(On Diff #224861)	s/integer/value/

craig.topper added inline comments.Oct 14 2019, 10:21 AM

lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
385 ↗	(On Diff #224861)	Just use ISD::FREEZE instead of N->getOpcode()
4021 ↗	(On Diff #224861)	I think this should be with the other PromoteIntOp functions? I think that's how this file is sorted.
4021 ↗	(On Diff #224861)	Does this function even get called? The input type and the output type are the same right? So PromoteIntRes should have been called first.
4025 ↗	(On Diff #224861)	Same with this. We should expand the result and then we'll never have to expand the input separately.
lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
3105 ↗	(On Diff #224861)	Would we better off just handling freeze on its own instead of adding all this aggregate code to the generic visitUnary?

(as per inline comments)

This revision now requires changes to proceed.Oct 20 2019, 4:40 AM

Remove DAGTypeLegalizer::PromoteIntOp_FREEZE, DAGTypeLegalizer::ExpandIntOp_FREEZE as they are never called
Add cost model of freeze
Add vector legalization test to freeze-legalize.ll

aqjune marked 6 inline comments as done.Oct 21 2019, 1:06 AM

aqjune added inline comments.

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
3105 ↗	(On Diff #224861)	I put it here as visitBinary() was also handling opcodes, but I have no special preference.

@qcolombet - ping as per inline comment?
I'm also not really fully sold that the SelectionDAGISel::Select_FREEZE()
approach is fully future-proof/correct.

Other than that, nothing major stands out to me here.

include/llvm/Analysis/TargetTransformInfoImpl.h
853 ↗	(On Diff #225827)	Also here And in `TargetTransformInfo::getInstructionThroughput()`
lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
4055 ↗	(On Diff #225827)	Spurious newline change
lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
3106 ↗	(On Diff #225827)	`assert(Opcode == ISD::FREEZE && "Can only get here for` freeze` nodes");`
3110 ↗	(On Diff #225827)	for (unsigned i = 0, NumOps = Op.getNumOperands(); i < NumOps; ++i)
lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2238 ↗	(On Diff #85346)	@qcolombet ping, please?

I was looking through MachineInstr's optimizations to recheck validity of using IMPLICIT_DEF, and found that it caused problems at two cases.

When a function call is involved, its value is not preserved.

define i32 @foo()
  %y1 = freeze i32 undef
  %k = add i32 %y1, 10
  call i32 @g()
  %k2 = add i32 %y1, 20
  ..

_foo:
  addl  $10, %ebx
  callq _g
  addl  $20, %eax <- this is wrong because %eax and %ebx may differ
  ..

When PHI node is involved, it is folded into an incorrect value.

  br i1 %cond, label %BB1, label %BB2
BB1:
  %y1 = freeze i32 undef
  %k1 = sub i32 %y1, 1
  br label %END
BB2:
  %y2 = freeze i32 undef
  br label %END
END:
  %p = phi i32 [%y1, %BB1], [%y2, %BB2]
  %p2 = phi i32 [%k1, %BB1], [0, %BB2]
  store i32 %p, i32* @x
  store i32 %p2, i32* @y

LBB0_3:                                 ## %END
  movl  %eax, _x(%rip)
  movl  %eax, _y(%rip) <- should be "%eax - 1" if %cond is true

But, other than these two (function call and phi), other operations seemed okay.
Arithmetic operations including add / mul didn't fold IMPLICIT_DEF in a undef-like way.

What would be a good way to address the two cases?
The latter seemed to be done by ProcessImplicitDefs.cpp / PHIElimination.cpp. The former one seems more complicated; it seems to require preservation of information about the original IMPLICIT_DEF before it is removed by later pipeline.

arsenm added a subscriber: arsenm.Oct 27 2019, 2:33 PM

arsenm added inline comments.

lib/CodeGen/SelectionDAG/FastISel.cpp
1573 ↗	(On Diff #225827)	s/unsigned/Register
lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
3109 ↗	(On Diff #225827)	1 seems a bit small for the size
lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2238 ↗	(On Diff #85346)	I see one potential problem with using a COPY of IMPLICIT_DEF. Suppose an instruction has two registers operands with different register classes. A codegen pass may reasonably try to rematerialize one of the implicit_defs to the natural operand register class to avoid the copy to satisfy operand constraints
2302 ↗	(On Diff #225827)	s/unsigned/Register/
test/CodeGen/X86/freeze.ll
89 ↗	(On Diff #225827)	I'm not sure where Phabricator gets it syntax highlighting from, but it should add the freeze keyword

aqjune added a parent revision: D69932: [IR] Redefine Freeze instruction.Nov 6 2019, 11:41 PM

Add FREEZE pseudoinstruction for MachineIR
Let ExpandPostRAPseudos expand FREEZE
Address reviewers' comments
Add tests for function call / phi
Add a comment to TargetPassConfig stating that after ExpandPostRAPseudos IMPLICIT_DEF should not be optimized to different registers/constants

Herald added a project: Restricted Project. · View Herald TranscriptNov 11 2019, 8:56 AM

Herald added subscribers: Jim, hiraditya, dylanmckay. · View Herald Transcript

I added FREEZE pseudoinstruction to MachineIR, as it seemed to be the succinct way to make it correct.
I made ExpandPostRAPseudos expand the FREEZE to register-copy assembly instruction. ExpandPostRAPseudos pass is done after register allocation (which, or at least a pass relevant to it, seems to replace uses of IMPLICIT_DEF with different registers) as well as ProcessImplicitDef (which is the culprit of the incorrect PHI optimization example) and PHIElimination. I left a commit that explicitly states that IMPLICIT_DEF cannot be used in an undef-y way after the pass.

craig.topper added inline comments.Nov 11 2019, 7:50 PM

llvm/include/llvm/Support/TargetOpcodes.def
59	Comment says IMPLICIT_DEF
llvm/lib/CodeGen/SelectionDAG/FastISel.cpp
1571	Why doesn't this TargetOpcode::Freeze?
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
3106	Separate this into just visitFreeze.

Let FastISel emit FREEZE
Let SelectionDAGBuilder::visitFreeze lower freeze IR instructions
Minor fixes

aqjune marked 3 inline comments as done.Nov 11 2019, 10:59 PM

aqjune retitled this revision from [SelDag] Implement FREEZE node to [SelDag][MIR] Add FREEZE .Nov 11 2019, 11:07 PM

craig.topper added inline comments.Nov 11 2019, 11:42 PM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
3104	Drop this change?

lkail added a subscriber: lkail.Nov 12 2019, 12:02 AM

Do you also want a MIR test that shows IR freeze -> MIR freeze lowering?

Add freeze IR -> MIR test

aqjune marked an inline comment as done.Nov 13 2019, 10:46 PM

aqjune added inline comments.

llvm/test/CodeGen/X86/freeze-mir.ll
2	I made a separate `freeze-mir.ll` file for mir test because `update_mir_test_checks.py` did not work after `freeze.ll` is processed with `update_llc_test_checks.py`

Hmm, thanks, i think this now looks good to me.
@craig.topper / @qcolombet / @arsenm ?

llvm/test/CodeGen/X86/freeze-call.ll
8–13	You can avoid cfi noise via `define i32 @foo() nounwind {`

Add nounwind to tests

arsenm added inline comments.Nov 14 2019, 8:51 PM

llvm/include/llvm/Target/Target.td
1067	Why isNotDuplicable? I don't think this is the best maintained instruction property

aqjune marked an inline comment as done.Nov 14 2019, 9:36 PM

aqjune added inline comments.

llvm/include/llvm/Target/Target.td
1067	It is because different freeze defs can yield different values. For example, consider following transformation: %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) use(%x) // these two uses should see the same freezed value -> %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) %x' = FREEZE %undef use(%x') // It is possible that %x and %x' are assigned differently freezed values. This transformation is incorrect, because use()s after the transformation can see different values. To prevent this class of optimizations, isNotDuplicable is set to 1.

arsenm added inline comments.Nov 14 2019, 9:47 PM

llvm/include/llvm/Target/Target.td
1067	But these both are using the same undef vreg? The second freeze is still using the original undef, so this should be fine?

arsenm added inline comments.Nov 14 2019, 9:53 PM

llvm/include/llvm/Target/Target.td
1067	To clarify, I think the property you are really looking for is not the property given to you by isNotDuplicable. What you really care about is the input operand not using a new instance of undef. The "nonduplicability" is a function of the input value and not the instruction itself

aqjune marked an inline comment as done.Nov 14 2019, 9:55 PM

aqjune added inline comments.

llvm/include/llvm/Target/Target.td
1067	The second use of the undef register can be assigned a different physical register at (perhaps) register allocation, after seeing that its definition is IMPLICIT_DEF. %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) %x' = FREEZE %undef use(%x') => %x = FREEZE $eax use(%x) %x' = FREEZE $ebx use(%x') This transformation can happen if there is a function call between them. `test/CodeGen/X86/freeze-call.ll` checks that it never happens.

arsenm added inline comments.Nov 14 2019, 9:59 PM

llvm/include/llvm/Target/Target.td
1067	I think handling this correctly requires changing to how IMPLICIT_DEF is handled in register allocation. I don't think noduplicate is really going to model this constraint sufficiently

aqjune marked an inline comment as not done.Nov 14 2019, 10:13 PM

aqjune added inline comments.

llvm/include/llvm/Target/Target.td
1067	It would make complexity of this patch (lowering of freeze IR instruction) easier, but would the change in register allocation need performance tests? For example, if all uses of IMPLICIT_DEF should see a same caller-save register, the register should be spilled whenever its liveness crosses a function call. Currently IMPLICIT_DEF is commented as `MachineInstr-level equivalent of undef`, and undef`may evaluate to different values as well, so I wonder whether there are passes other than regalloc that use that semantics too. isNotDuplicable's comment says class MCInstrDesc { ... /// Return true if this instruction cannot be safely /// duplicated. For example, if the instruction has a unique labels attached /// to it, duplicating it would cause multiple definition errors. bool isNotDuplicable() const { return Flags & (1ULL << MCID::NotDuplicable); } so I thought non-duplicability was a property of the freeze instruction itself.

arsenm added inline comments.Nov 14 2019, 10:40 PM

llvm/include/llvm/Target/Target.td
1067	I mean the definition of IMPLICIT_DEF needs to change to support this. I think we may actually need two different flavors of IMPLICIT_DEF. noduplicate is a stronger barrier to useful transforms, and also doesn't model the real problem here. If you started out with two separate freeze instructions reading the same IMPLICIT_DEF register, you would still see the same problem in your example.

aqjune added inline comments.Nov 14 2019, 11:36 PM

llvm/include/llvm/Target/Target.td
1067	I mean the definition of IMPLICIT_DEF needs to change to support this. I think we may actually need two different flavors of IMPLICIT_DEF. Understood. So we can have two kinds of IMPLICIT_DEFs; the first one is an instruction that can be optimized to different values whenever used. The second one has the same value whenever it is used. I think the second IMPLICIT_DEF is equivalent to FREEZE (IMPLICIT_DEF of the first kind), because freeze is an instruction that guarantees all uses of the value see the same value even though the operand is an undef-like value. If you started out with two separate freeze instructions reading the same IMPLICIT_DEF register, you would still see the same problem in your example. If the program originally contained two separated freezes, then it is okay, because the program is originally written in that way. The programmer must have written down two freezes in LLVM IR bitcode, and they are lowered into MIR. The problematic case is when a program with a single freeze is optimized to a program with several freezes. Different freezes can return different values. %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) use(%x) // these two uses see the same value with help of freeze. -> %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) %x' = FREEZE %undef use(%x') // Now %x and %x' can be assigned different values, so this optimization is wrong.

arsenm added inline comments.Nov 14 2019, 11:50 PM

llvm/include/llvm/Target/Target.td
1067	I think it's important to define the rules here clearly for MIR, and not rely on expectations from the original IR. Some legalization for example may end up inserting multiple freezes of the same input value, and that should work correctly. Multiple freezes of the same input register should produce the same value.

aqjune added inline comments.Nov 15 2019, 12:49 AM

llvm/include/llvm/Target/Target.td
1067	I agree, I'll write the rule for freeze. Regarding legalization, in which case can it be copied? The case that I was aware of was splitting a register of illegal size, which copies freeze but does not increase # of uses per definition: %x = ... // invalid type, say i50 %y = freeze %x => %x1 = ... // i32 %x2 = ... // i18 %y1 = freeze %x1 %y2 = freeze %x2

liuz added a subscriber: liuz.Nov 15 2019, 8:39 PM

Looks like this is waiting on an update.
Really happy to see this progressing.

This revision now requires changes to proceed.Nov 22 2019, 3:19 AM

I added description about FREEZE and IMPLICIT_DEF to TargetOpcodes.def.
To address the legalization issue, updated the semantics so FREEZEs return the same value if placed consecutively. Would this properly address the issue?

I'm sorry, it seems the track of this patch was lost.
We really need to get this last bit of freeze support landed.
Can anyone spot any remaining issues with the patch?

craig.topper added inline comments.Jan 8 2020, 3:36 PM

llvm/include/llvm/Support/TargetOpcodes.def
61	constraint -> constrain

The patch & semantics look good to me, but I'm not a backend expert. I'll leave the final LGTM to someone else.
It would be awesome if we could get this in for 10.0 so that we have complete support for freeze.

Rebase
Address a comment

Herald added a subscriber: Petar.Avramovic. · View Herald TranscriptJan 9 2020, 8:09 AM

Harbormaster completed remote builds in B43603: Diff 237090.Jan 9 2020, 8:12 AM

To synchronize - the remaining issue was about duplicability of the freeze instruction in MIR.

In IR, a duplicated freeze may result in a different value due to the nature of undef. However, in MIR, this may be problematic in some cases.

My suggestion is to allow freeze in MIR to yield the same value when they are consecutively arranged.
This involved clarificaton of when the value of IMPLICIT_DEF could change during execution, and the updated comments of TargetOpcodes.def reflects that.

Currently, isNotDuplicable is still set to 1 to leave it as conservative & copiers deal with freeze specially in the future.

BTW, there was a discussion about IMPLICIT_DEF as well, about whether it should return consistent value for every use vs. it can return different values per use (as IR's undef).

Currently IMPLICIT_DEF can yieid different values, and there are a few cases where that happens (such as using IMPLICIT_DEF across function calls).

Making it return consistent value will allow FREEZE to be duplicable, but it would require fixing several transformations that deal with IMPLICIT_DEF.

Before discussion about IMPLICIT_DEF is further made and it is changed to return concrete value, I'd like to suggest leaving FREEZE as semi-duplicable (as suggested by this patch), but not fully duplicable.

efriedma added inline comments.Jan 17 2020, 4:21 PM

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
1135 ↗	(On Diff #237090)	If we expand FREEZE in ExpandPostRAPseudos, how can we reach this code?

Rebase
Remove AsmPrinter::emitFreeze

Harbormaster completed remote builds in B44371: Diff 239013.Jan 19 2020, 4:00 PM

aqjune marked 2 inline comments as done.Jan 19 2020, 4:05 PM

aqjune added inline comments.

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
1135 ↗	(On Diff #237090)	It was a legacy code to see the freeze comment from assembly before updating ExpandPostRAPseudos. I removed it.

For the next step, what can I do?

If duplicability is the main concern, I can split this patch so (1) FREEZE is defined to be duplicable first, and (2) when there happens a practical case where duplication matters, the next one is created.

It will reduce diffs in TargetOpcodes.def as well because the comment update at IMPLICIT_DEF is not needed anymore.

aqjune marked 3 inline comments as done.Jan 19 2020, 4:40 PM

Rebase

Harbormaster completed remote builds in B45333: Diff 241401.Jan 30 2020, 3:51 AM

ping

lebedev.ri mentioned this in D74228: [PatternMatch] Match XOR variant of unsigned-add overflow check..Feb 16 2020, 6:56 AM

Re-ping

ping

If adding FREEZE to MachineIR causes maintenance cost, another possible solution is to lower SelDag's FREEZE(UNDEF) to IMPLICIT_DEF directly.
In this option, FREEZE only exist in SelDag, not in MIR.
Instead, now IMPLICIT_DEF in MIR should always yield the same value. This requires a few transformations regarding IMPLICIT_DEF in MIR to be fixed.
This may cause suboptimal codegen in a few cases, but I believe we have a solution for the cases, because a single UNDEF from SelDag can be always lowered into multiple IMPLICIT_DEFs (per each use) in MIR.

This seems good to me, but you want to wait for someone else more knowledgeable..
ping @arsenm @qcolombet @craig.topper

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h
65–68	This is unrelated, please feel free to commit this right away.

This revision is now accepted and ready to land.Feb 26 2020, 2:13 AM

Herald added a subscriber: wdng. · View Herald TranscriptFeb 26 2020, 2:13 AM

Rebase
Make the update in getOperationCost as a separate commit (4f71252)

aqjune marked an inline comment as done.Feb 26 2020, 10:02 AM

Harbormaster completed remote builds in B47334: Diff 246778.Feb 26 2020, 10:44 AM

aqjune mentioned this in D76483: [DivRemPairs] Freeze operands if they can be undef values.Mar 21 2020, 12:26 AM

aqjune added a child revision: D76483: [DivRemPairs] Freeze operands if they can be undef values.Mar 22 2020, 10:59 PM

Leave the SelDag patch only, MachineIR part will be made as a separate patch (as discussed at D76483)

aqjune edited the summary of this revision. (Show Details)Mar 23 2020, 12:19 AM

aqjune retitled this revision from [SelDag][MIR] Add FREEZE to [SelDag] Add FREEZE.

Harbormaster completed remote builds in B50073: Diff 251955.Mar 23 2020, 1:06 AM

spatel added inline comments.Mar 23 2020, 1:29 PM

llvm/test/CodeGen/X86/freeze-legalize.ll
9–13	Do we need to add basic simplify / constant folding for SDAG ? freeze ( Constant ) --> Constant

aqjune marked an inline comment as done.Mar 23 2020, 5:10 PM

aqjune added inline comments.

llvm/test/CodeGen/X86/freeze-legalize.ll
9–13	Yes, I agree it will be great. Do you want to make this patch contain the change as well?

aqjune marked an inline comment as done.Mar 24 2020, 5:45 AM

aqjune added inline comments.

llvm/test/CodeGen/X86/freeze-legalize.ll
9–13	Or I can land this first, and add the simplify / constant folding for SDAG. I prefer incrementally making things because this patch itself is a big change.

spatel added inline comments.Mar 24 2020, 6:10 AM

llvm/test/CodeGen/X86/freeze-legalize.ll
9–13	I prefer smaller patches too. Let's make that a follow-up. Is it correct that there is very little chance that this patch will create a visible performance regression (because there should be almost no freeze instruction creation in IR yet)?

aqjune marked an inline comment as done.Mar 24 2020, 6:56 AM

aqjune added inline comments.

llvm/test/CodeGen/X86/freeze-legalize.ll
9–13	Yes, it is. Currently there is only one place where freeze is introduced - https://reviews.llvm.org/D76179 I checked that from assembly outputs of LLVM test-suite , only 3 / 5239 files are affected by this patch.

Closed by commit rG7802be4a3d86: [SelDag] Add FREEZE (authored by aqjune). · Explain WhyMar 24 2020, 7:30 AM

This revision was automatically updated to reflect the committed changes.

spatel added inline comments.Mar 24 2020, 7:37 AM

llvm/test/CodeGen/X86/freeze-legalize.ll
9–13	That sounds good then. But can we avoid those 3 regressions cases before or within this patch? Ideally, we don't want to knowingly regress anything.

aqjune marked an inline comment as done.Mar 24 2020, 8:22 AM

aqjune added inline comments.

llvm/test/CodeGen/X86/freeze-legalize.ll
9–13	Among 3 files, two were simple regressions that had a bit more verbose assembly: ... sete %al orb %bpl, %al jne .LBB9_1 => ... sete %al orb %bpl, %al testb $1, %al jne .LBB9_1 ... testb $1, %al je .LBB0_2 => ... movl %eax, %ecx andl $1, %ecx je .LBB0_2 Case 3's assembly diff was bigger, so needs inspection. I can visit it after the simpler two cases are resolved.

spatel mentioned this in D76707: [DAGCombine] Add basic optimizations for FREEZE in SelDag.Mar 24 2020, 9:01 AM

@aqjune

@bkramer committed a fix for a crash in LegalizeFloatTypes where this operator appeared in SoftPromoteHalfResult

Our downstream ARM compiler is crashing in regressions in a similar manner in SoftenFloatResult in the same file. Is it not conceivable that ISD::FREEZE might make it to this function? If that's the case, we can look into it and start up a new review if necessary. The test causing this crash is select-cc.ll.

AbigailLinden added a subscriber: AbigailLinden.Mar 27 2020, 2:08 PM

In D29014#1946763, @JamesNagurne wrote:

@aqjune

@bkramer committed a fix for a crash in LegalizeFloatTypes where this operator appeared in SoftPromoteHalfResult

Our downstream ARM compiler is crashing in regressions in a similar manner in SoftenFloatResult in the same file. Is it not conceivable that ISD::FREEZE might make it to this function? If that's the case, we can look into it and start up a new review if necessary. The test causing this crash is select-cc.ll.

Hi, I saw @bkramer 's fix too (commit id 0019c2f194a5e1f4cd65c5284e204328cc40ab3d ), and I think it is a valid fix. Similar thing can be applied to the SoftenFloatResult function. I'll make a patch for this.

aqjune added a child revision: D76980: [LegalizeTypes] Add SoftenFloatRes_FREEZE.Mar 28 2020, 1:42 AM

ZhangKang mentioned this in rGa8e15ee04a7f: [CodeGen] Support freeze expand for ppc_fp128.Apr 20 2020, 1:02 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

TargetTransformInfoImpl.h

5 lines

CodeGen/

FastISel.h

1 line

ISDOpcodes.h

5 lines

SelectionDAGISel.h

2 lines

Support/

TargetOpcodes.def

51 lines

Target/

Target.td

9 lines

lib/

CodeGen/

ExpandPostRAPseudos.cpp

3 lines

SelectionDAG/

FastISel.cpp

24 lines

LegalizeIntegerTypes.cpp

11 lines

LegalizeTypes.h

2 lines

LegalizeTypesGeneric.cpp

9 lines

LegalizeVectorTypes.cpp

3 lines

SelectionDAGBuilder.cpp

20 lines

SelectionDAGDumper.cpp

1 line

SelectionDAGISel.cpp

8 lines

TargetLoweringBase.cpp

2 lines

TargetPassConfig.cpp

10 lines

Target/

AVR/

AVRInstrInfo.cpp

1 line

test/

CodeGen/

X86/

15 lines

5 lines

35 lines

93 lines

120 lines

59 lines

104 lines

Diff 239013

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	case Instruction::BitCast:
assert(OpTy && "Cast instructions must provide the operand type");		assert(OpTy && "Cast instructions must provide the operand type");
if (Ty == OpTy \|\| (Ty->isPointerTy() && OpTy->isPointerTy()))		if (Ty == OpTy \|\| (Ty->isPointerTy() && OpTy->isPointerTy()))
// Identity and pointer-to-pointer casts are free.		// Identity and pointer-to-pointer casts are free.
return TTI::TCC_Free;		return TTI::TCC_Free;

// Otherwise, the default basic cost is used.		// Otherwise, the default basic cost is used.
return TTI::TCC_Basic;		return TTI::TCC_Basic;

		case Instruction::Freeze:
		// Freeze operation is free because it should be lowered into a register
		// use without any register copy in assembly code.
		return TTI::TCC_Free;
		lebedev.riUnsubmitted Done Reply Inline Actions This is unrelated, please feel free to commit this right away. lebedev.ri: This is unrelated, please feel free to commit this right away.

case Instruction::FDiv:		case Instruction::FDiv:
case Instruction::FRem:		case Instruction::FRem:
case Instruction::SDiv:		case Instruction::SDiv:
case Instruction::SRem:		case Instruction::SRem:
case Instruction::UDiv:		case Instruction::UDiv:
case Instruction::URem:		case Instruction::URem:
return TTI::TCC_Expensive;		return TTI::TCC_Expensive;

▲ Show 20 Lines • Show All 865 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/FastISel.h

Show First 20 Lines • Show All 528 Lines • ▼ Show 20 Lines	protected:
bool selectBinaryOp(const User *I, unsigned ISDOpcode);		bool selectBinaryOp(const User *I, unsigned ISDOpcode);
bool selectFNeg(const User I, const Value In);		bool selectFNeg(const User I, const Value In);
bool selectGetElementPtr(const User *I);		bool selectGetElementPtr(const User *I);
bool selectStackmap(const CallInst *I);		bool selectStackmap(const CallInst *I);
bool selectPatchpoint(const CallInst *I);		bool selectPatchpoint(const CallInst *I);
bool selectCall(const User *I);		bool selectCall(const User *I);
bool selectIntrinsicCall(const IntrinsicInst *II);		bool selectIntrinsicCall(const IntrinsicInst *II);
bool selectBitCast(const User *I);		bool selectBitCast(const User *I);
		bool selectFreeze(const User *I);
bool selectCast(const User *I, unsigned Opcode);		bool selectCast(const User *I, unsigned Opcode);
bool selectExtractValue(const User *U);		bool selectExtractValue(const User *U);
bool selectInsertValue(const User *I);		bool selectInsertValue(const User *I);
bool selectXRayCustomEvent(const CallInst *II);		bool selectXRayCustomEvent(const CallInst *II);
bool selectXRayTypedEvent(const CallInst *II);		bool selectXRayTypedEvent(const CallInst *II);

bool shouldOptForSize(const MachineFunction *MF) const {		bool shouldOptForSize(const MachineFunction *MF) const {
// TODO: Implement PGSO.		// TODO: Implement PGSO.
▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 172 Lines • ▼ Show 20 Lines	enum NodeType {
/// CopyFromReg - This node indicates that the input value is a virtual or		/// CopyFromReg - This node indicates that the input value is a virtual or
/// physical register that is defined outside of the scope of this		/// physical register that is defined outside of the scope of this
/// SelectionDAG. The register is available from the RegisterSDNode object.		/// SelectionDAG. The register is available from the RegisterSDNode object.
CopyFromReg,		CopyFromReg,

/// UNDEF - An undefined node.		/// UNDEF - An undefined node.
UNDEF,		UNDEF,

		// FREEZE - FREEZE(VAL) returns an arbitrary value if VAL is UNDEF (or
		// is evaluated to UNDEF), or returns VAL otherwise. Note that each
		// read of UNDEF can yield different value, but FREEZE(UNDEF) cannot.
		FREEZE,

/// EXTRACT_ELEMENT - This is used to get the lower or upper (determined by		/// EXTRACT_ELEMENT - This is used to get the lower or upper (determined by
/// a Constant, which is required to be operand #1) half of the integer or		/// a Constant, which is required to be operand #1) half of the integer or
/// float value specified as operand #0. This is only for use before		/// float value specified as operand #0. This is only for use before
/// legalization, for values that will be broken into multiple registers.		/// legalization, for values that will be broken into multiple registers.
EXTRACT_ELEMENT,		EXTRACT_ELEMENT,

/// BUILD_PAIR - This is the opposite of EXTRACT_ELEMENT in some ways.		/// BUILD_PAIR - This is the opposite of EXTRACT_ELEMENT in some ways.
/// Given two values of the same integer value type, this produces a value		/// Given two values of the same integer value type, this produces a value
▲ Show 20 Lines • Show All 948 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/SelectionDAGISel.h

Show First 20 Lines • Show All 318 Lines • ▼ Show 20 Lines	private:

// Calls to these functions are generated by tblgen.		// Calls to these functions are generated by tblgen.
void Select_INLINEASM(SDNode *N, bool Branch);		void Select_INLINEASM(SDNode *N, bool Branch);
void Select_READ_REGISTER(SDNode *Op);		void Select_READ_REGISTER(SDNode *Op);
void Select_WRITE_REGISTER(SDNode *Op);		void Select_WRITE_REGISTER(SDNode *Op);
void Select_UNDEF(SDNode *N);		void Select_UNDEF(SDNode *N);
void CannotYetSelect(SDNode *N);		void CannotYetSelect(SDNode *N);

		void Select_FREEZE(SDNode *N);

private:		private:
void DoInstructionSelection();		void DoInstructionSelection();
SDNode MorphNode(SDNode Node, unsigned TargetOpc, SDVTList VTList,		SDNode MorphNode(SDNode Node, unsigned TargetOpc, SDVTList VTList,
ArrayRef<SDValue> Ops, unsigned EmitNodeInfo);		ArrayRef<SDValue> Ops, unsigned EmitNodeInfo);

SDNode MutateStrictFPToFP(SDNode Node, unsigned NewOpc);		SDNode MutateStrictFPToFP(SDNode Node, unsigned NewOpc);

/// Prepares the landing pad to take incoming values or do other EH		/// Prepares the landing pad to take incoming values or do other EH
▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/include/llvm/Support/TargetOpcodes.def

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	/// INSERT_SUBREG - This instruction takes three operands: a register that			/// INSERT_SUBREG - This instruction takes three operands: a register that
	/// has subregisters, a register providing an insert value, and a			/// has subregisters, a register providing an insert value, and a
	/// subregister index. It returns the value of the first register with the			/// subregister index. It returns the value of the first register with the
	/// value of the second register inserted. The first register is often			/// value of the second register inserted. The first register is often
	/// defined by an IMPLICIT_DEF, because it is commonly used to implement			/// defined by an IMPLICIT_DEF, because it is commonly used to implement
	/// anyext operations on target architectures which support it.			/// anyext operations on target architectures which support it.
	HANDLE_TARGET_OPCODE(INSERT_SUBREG)			HANDLE_TARGET_OPCODE(INSERT_SUBREG)

	/// IMPLICIT_DEF - This is the MachineInstr-level equivalent of undef.			/// IMPLICIT_DEF - This instruction simulates LLVM IR's undef/poison value by
				/// creating a register that contains a value of an arbitrary bit pattern.
				/// The register's value can be changed by any instruction that is executed
				/// after IMPLICIT_DEF, except FREEZE. This implies that different instructions
				craig.topperUnsubmitted Done Reply Inline Actions Comment says IMPLICIT_DEF craig.topper: Comment says IMPLICIT_DEF
				/// using the IMPLICIT_DEF can see different values.
				/// To constrain different instructions to see the same value of this register,
				craig.topperUnsubmitted Done Reply Inline Actions constraint -> constrain craig.topper: constraint -> constrain
				/// FREEZE operation can be used.
				/// %1 = IMPLICIT_DEF
				/// read(%1) ; may have implicitly changed the value of register %1
				/// read(%1) ; can read a different value
				/// %2 = FREEZE %1
				/// read(%2)
				/// read(%2) ; these two read the same value from register %2.
				///
				/// Some instructions may have IMPLICIT_DEF-like output register, if inputs are
				/// IMPLICIT_DEF. IMPLICIT_DEF-like register means it works exactly as
				/// IMPLICIT_DEF.
				/// - COPY and COPY_TO_REGCLASS of IMPLICIT_DEF has IMPLICIT_DEF-like output
				/// register, so its value can change by execution of its following
				/// instructions. This allows optimizing COPY(IMPLICIT_DEF) to IMPLICIT_DEF.
				/// - PHI having an IMPLICIT_DEF operand has IMPLICIT_DEF-like output register
				/// if the previous block was from the corresponding basic block. This allows
				/// optimizing PHI(IMPLICIT_DEF, ..., IMPLICIT_DEF) to IMPLICIT_DEF.
				/// - A register can have IMPLICIT_DEF subregisters via REG_SEQUENCE or
				/// INSERT_SUBREG. If EXTRACT_SUBREG extracts one of such subregisters, the
				/// output register is also IMPLICIT_DEF-like.
				///
				/// Except these operations and IMPLICIT_DEF, all other instructions' output
				/// registers behave as expected.
	HANDLE_TARGET_OPCODE(IMPLICIT_DEF)			HANDLE_TARGET_OPCODE(IMPLICIT_DEF)

				/// FREEZE - This is the MachineInstr-level equivalent of freeze. It copies the
				/// value of the register operand, but unlike IMPLICIT_DEF, the output
				/// register's value is preserved over the execution of following instructions.
				/// Note that COPY(IMPLICIT_DEF) is different from FREEZE(IMPLICIT_DEF), because
				/// COPY's output register works as IMPLICIT_DEF register again (see the
				/// description of IMPLICIT_DEF).
				/// %1 = IMPLICIT_DEF
				/// %2 = FREEZE %1
				/// read(%2)
				/// read(%2) ; these two read the same value.
				///
				/// Unlike other instructions, FREEZE does not change the value of IMPLICIT_DEF
				/// registers, meaning that consecutive FREEZEs on the same operand yield the
				/// same value. If they are separated, they might return different values.
				/// %1 = IMPLICIT_DEF
				/// %2 = FREEZE %1
				/// read(%2) ; this may have changed the value of %1.
				/// %3 = FREEZE %1 ; %3 and %2 may have different values.
				/// %4 = FREEZE %1 ; %3 and %4 has the same value.
				HANDLE_TARGET_OPCODE(FREEZE)

	/// SUBREG_TO_REG - Assert the value of bits in a super register.			/// SUBREG_TO_REG - Assert the value of bits in a super register.
	/// The result of this instruction is the value of the second operand inserted			/// The result of this instruction is the value of the second operand inserted
	/// into the subregister specified by the third operand. All other bits are			/// into the subregister specified by the third operand. All other bits are
	/// assumed to be equal to the bits in the immediate integer constant in the			/// assumed to be equal to the bits in the immediate integer constant in the
	/// first operand. This instruction just communicates information; No code			/// first operand. This instruction just communicates information; No code
	/// should be generated.			/// should be generated.
	/// This is typically used after an instruction where the write to a subregister			/// This is typically used after an instruction where the write to a subregister
	/// implicitly cleared the bits in the super registers.			/// implicitly cleared the bits in the super registers.
	▲ Show 20 Lines • Show All 564 Lines • Show Last 20 Lines

llvm/include/llvm/Target/Target.td

	Show First 20 Lines • Show All 1,052 Lines • ▼ Show 20 Lines
	def IMPLICIT_DEF : StandardPseudoInstruction {			def IMPLICIT_DEF : StandardPseudoInstruction {
	let OutOperandList = (outs unknown:$dst);			let OutOperandList = (outs unknown:$dst);
	let InOperandList = (ins);			let InOperandList = (ins);
	let AsmString = "";			let AsmString = "";
	let hasSideEffects = 0;			let hasSideEffects = 0;
	let isReMaterializable = 1;			let isReMaterializable = 1;
	let isAsCheapAsAMove = 1;			let isAsCheapAsAMove = 1;
	}			}
				def FREEZE : StandardPseudoInstruction {
				let OutOperandList = (outs unknown:$dst);
				let InOperandList = (ins unknown:$src);
				let AsmString = "FREEZE";
				let hasSideEffects = 0;
				let isAsCheapAsAMove = 1;
				let hasNoSchedulingInfo = 1;
				arsenmUnsubmitted Not Done Reply Inline Actions Why isNotDuplicable? I don't think this is the best maintained instruction property arsenm: Why isNotDuplicable? I don't think this is the best maintained instruction property
				aqjuneAuthorUnsubmitted Done Reply Inline Actions It is because different freeze defs can yield different values. For example, consider following transformation: %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) use(%x) // these two uses should see the same freezed value -> %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) %x' = FREEZE %undef use(%x') // It is possible that %x and %x' are assigned differently freezed values. This transformation is incorrect, because use()s after the transformation can see different values. To prevent this class of optimizations, isNotDuplicable is set to 1. aqjune: It is because different freeze defs can yield different values. For example, consider following…
				arsenmUnsubmitted Not Done Reply Inline Actions But these both are using the same undef vreg? The second freeze is still using the original undef, so this should be fine? arsenm: But these both are using the same undef vreg? The second freeze is still using the original…
				arsenmUnsubmitted Not Done Reply Inline Actions To clarify, I think the property you are really looking for is not the property given to you by isNotDuplicable. What you really care about is the input operand not using a new instance of undef. The "nonduplicability" is a function of the input value and not the instruction itself arsenm: To clarify, I think the property you are really looking for is not the property given to you by…
				aqjuneAuthorUnsubmitted Not Done Reply Inline Actions The second use of the undef register can be assigned a different physical register at (perhaps) register allocation, after seeing that its definition is IMPLICIT_DEF. %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) %x' = FREEZE %undef use(%x') => %x = FREEZE $eax use(%x) %x' = FREEZE $ebx use(%x') This transformation can happen if there is a function call between them. `test/CodeGen/X86/freeze-call.ll` checks that it never happens. aqjune: The second use of the undef register can be assigned a different physical register at (perhaps)…
				arsenmUnsubmitted Not Done Reply Inline Actions I think handling this correctly requires changing to how IMPLICIT_DEF is handled in register allocation. I don't think noduplicate is really going to model this constraint sufficiently arsenm: I think handling this correctly requires changing to how IMPLICIT_DEF is handled in register…
				aqjuneAuthorUnsubmitted Not Done Reply Inline Actions It would make complexity of this patch (lowering of freeze IR instruction) easier, but would the change in register allocation need performance tests? For example, if all uses of IMPLICIT_DEF should see a same caller-save register, the register should be spilled whenever its liveness crosses a function call. Currently IMPLICIT_DEF is commented as `MachineInstr-level equivalent of undef`, and undef`may evaluate to different values as well, so I wonder whether there are passes other than regalloc that use that semantics too. isNotDuplicable's comment says class MCInstrDesc { ... /// Return true if this instruction cannot be safely /// duplicated. For example, if the instruction has a unique labels attached /// to it, duplicating it would cause multiple definition errors. bool isNotDuplicable() const { return Flags & (1ULL << MCID::NotDuplicable); } so I thought non-duplicability was a property of the freeze instruction itself. aqjune: It would make complexity of this patch (lowering of freeze IR instruction) easier, but would…
				arsenmUnsubmitted Not Done Reply Inline Actions I mean the definition of IMPLICIT_DEF needs to change to support this. I think we may actually need two different flavors of IMPLICIT_DEF. noduplicate is a stronger barrier to useful transforms, and also doesn't model the real problem here. If you started out with two separate freeze instructions reading the same IMPLICIT_DEF register, you would still see the same problem in your example. arsenm: I mean the definition of IMPLICIT_DEF needs to change to support this. I think we may actually…
				aqjuneAuthorUnsubmitted Not Done Reply Inline Actions I mean the definition of IMPLICIT_DEF needs to change to support this. I think we may actually need two different flavors of IMPLICIT_DEF. Understood. So we can have two kinds of IMPLICIT_DEFs; the first one is an instruction that can be optimized to different values whenever used. The second one has the same value whenever it is used. I think the second IMPLICIT_DEF is equivalent to FREEZE (IMPLICIT_DEF of the first kind), because freeze is an instruction that guarantees all uses of the value see the same value even though the operand is an undef-like value. If you started out with two separate freeze instructions reading the same IMPLICIT_DEF register, you would still see the same problem in your example. If the program originally contained two separated freezes, then it is okay, because the program is originally written in that way. The programmer must have written down two freezes in LLVM IR bitcode, and they are lowered into MIR. The problematic case is when a program with a single freeze is optimized to a program with several freezes. Different freezes can return different values. %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) use(%x) // these two uses see the same value with help of freeze. -> %undef = IMPLICIT_DEF %x = FREEZE %undef use(%x) %x' = FREEZE %undef use(%x') // Now %x and %x' can be assigned different values, so this optimization is wrong. aqjune: > I mean the definition of IMPLICIT_DEF needs to change to support this. I think we may…
				arsenmUnsubmitted Not Done Reply Inline Actions I think it's important to define the rules here clearly for MIR, and not rely on expectations from the original IR. Some legalization for example may end up inserting multiple freezes of the same input value, and that should work correctly. Multiple freezes of the same input register should produce the same value. arsenm: I think it's important to define the rules here clearly for MIR, and not rely on expectations…
				aqjuneAuthorUnsubmitted Not Done Reply Inline Actions I agree, I'll write the rule for freeze. Regarding legalization, in which case can it be copied? The case that I was aware of was splitting a register of illegal size, which copies freeze but does not increase # of uses per definition: %x = ... // invalid type, say i50 %y = freeze %x => %x1 = ... // i32 %x2 = ... // i18 %y1 = freeze %x1 %y2 = freeze %x2 aqjune: I agree, I'll write the rule for freeze. Regarding legalization, in which case can it be copied?
				let isNotDuplicable = 1;
				}
	def SUBREG_TO_REG : StandardPseudoInstruction {			def SUBREG_TO_REG : StandardPseudoInstruction {
	let OutOperandList = (outs unknown:$dst);			let OutOperandList = (outs unknown:$dst);
	let InOperandList = (ins unknown:$implsrc, unknown:$subsrc, i32imm:$subidx);			let InOperandList = (ins unknown:$implsrc, unknown:$subsrc, i32imm:$subidx);
	let AsmString = "";			let AsmString = "";
	let hasSideEffects = 0;			let hasSideEffects = 0;
	}			}
	def COPY_TO_REGCLASS : StandardPseudoInstruction {			def COPY_TO_REGCLASS : StandardPseudoInstruction {
	let OutOperandList = (outs unknown:$dst);			let OutOperandList = (outs unknown:$dst);
	▲ Show 20 Lines • Show All 560 Lines • Show Last 20 Lines

llvm/lib/CodeGen/ExpandPostRAPseudos.cpp

//===-- ExpandPostRAPseudos.cpp - Pseudo instruction expansion pass -------===//		//===-- ExpandPostRAPseudos.cpp - Pseudo instruction expansion pass -------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file defines a pass that expands COPY and SUBREG_TO_REG pseudo		// This file defines a pass that expands COPY, SUBREG_TO_REG, and FREEZE pseudo
// instructions after register allocation.		// instructions after register allocation.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/CodeGen/MachineFunctionPass.h"		#include "llvm/CodeGen/MachineFunctionPass.h"
#include "llvm/CodeGen/MachineInstr.h"		#include "llvm/CodeGen/MachineInstr.h"
#include "llvm/CodeGen/MachineInstrBuilder.h"		#include "llvm/CodeGen/MachineInstrBuilder.h"
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
▲ Show 20 Lines • Show All 188 Lines • ▼ Show 20 Lines	for (MachineBasicBlock::iterator mi = mbbi->begin(), me = mbbi->end();
continue;		continue;
}		}

// Expand standard pseudos.		// Expand standard pseudos.
switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
case TargetOpcode::SUBREG_TO_REG:		case TargetOpcode::SUBREG_TO_REG:
MadeChange \|= LowerSubregToReg(&MI);		MadeChange \|= LowerSubregToReg(&MI);
break;		break;
		case TargetOpcode::FREEZE:
case TargetOpcode::COPY:		case TargetOpcode::COPY:
MadeChange \|= LowerCopy(&MI);		MadeChange \|= LowerCopy(&MI);
break;		break;
case TargetOpcode::DBG_VALUE:		case TargetOpcode::DBG_VALUE:
continue;		continue;
case TargetOpcode::INSERT_SUBREG:		case TargetOpcode::INSERT_SUBREG:
case TargetOpcode::EXTRACT_SUBREG:		case TargetOpcode::EXTRACT_SUBREG:
llvm_unreachable("Sub-register pseudos should have been eliminated.");		llvm_unreachable("Sub-register pseudos should have been eliminated.");
}		}
}		}
}		}

return MadeChange;		return MadeChange;
}		}

llvm/lib/CodeGen/SelectionDAG/FastISel.cpp

Show First 20 Lines • Show All 1,562 Lines • ▼ Show 20 Lines	bool FastISel::selectBitCast(const User *I) {
// If the reg-reg copy failed, select a BITCAST opcode.		// If the reg-reg copy failed, select a BITCAST opcode.
if (!ResultReg)		if (!ResultReg)
ResultReg = fastEmit_r(SrcVT, DstVT, ISD::BITCAST, Op0, Op0IsKill);		ResultReg = fastEmit_r(SrcVT, DstVT, ISD::BITCAST, Op0, Op0IsKill);

if (!ResultReg)		if (!ResultReg)
return false;		return false;

updateValueMap(I, ResultReg);		updateValueMap(I, ResultReg);
return true;		return true;
		craig.topperUnsubmitted Done Reply Inline Actions Why doesn't this TargetOpcode::Freeze? craig.topper: Why doesn't this TargetOpcode::Freeze?
}		}

		bool FastISel::selectFreeze(const User *I) {
		Register Reg = getRegForValue(I->getOperand(0));
		if (!Reg)
		// Unhandled operand.
		return false;

		EVT ETy = TLI.getValueType(DL, I->getOperand(0)->getType());
		if (ETy == MVT::Other \|\| !TLI.isTypeLegal(ETy))
		// Unhandled type, bail out.
		return false;

		MVT Ty = ETy.getSimpleVT();
		const TargetRegisterClass *TyRegClass = TLI.getRegClassFor(Ty);
		Register ResultReg = createResultReg(TyRegClass);
		BuildMI(*FuncInfo.MBB, FuncInfo.InsertPt, DbgLoc,
		TII.get(TargetOpcode::FREEZE), ResultReg).addReg(Reg);

		updateValueMap(I, ResultReg);
		return true;
		}

// Remove local value instructions starting from the instruction after		// Remove local value instructions starting from the instruction after
// SavedLastLocalValue to the current function insert point.		// SavedLastLocalValue to the current function insert point.
void FastISel::removeDeadLocalValueCode(MachineInstr *SavedLastLocalValue)		void FastISel::removeDeadLocalValueCode(MachineInstr *SavedLastLocalValue)
{		{
MachineInstr *CurLastLocalValue = getLastLocalValue();		MachineInstr *CurLastLocalValue = getLastLocalValue();
if (CurLastLocalValue != SavedLastLocalValue) {		if (CurLastLocalValue != SavedLastLocalValue) {
// Find the first local value instruction to be deleted.		// Find the first local value instruction to be deleted.
// This is the instruction after SavedLastLocalValue if it is non-NULL.		// This is the instruction after SavedLastLocalValue if it is non-NULL.
▲ Show 20 Lines • Show All 325 Lines • ▼ Show 20 Lines	if (!Reg)
return false;		return false;
updateValueMap(I, Reg);		updateValueMap(I, Reg);
return true;		return true;
}		}

case Instruction::ExtractValue:		case Instruction::ExtractValue:
return selectExtractValue(I);		return selectExtractValue(I);

		case Instruction::Freeze:
		return selectFreeze(I);

case Instruction::PHI:		case Instruction::PHI:
llvm_unreachable("FastISel shouldn't visit PHI nodes!");		llvm_unreachable("FastISel shouldn't visit PHI nodes!");

default:		default:
// Unhandled instruction. Halt "fast" selection and bail.		// Unhandled instruction. Halt "fast" selection and bail.
return false;		return false;
}		}
}		}
▲ Show 20 Lines • Show All 557 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

Show First 20 Lines • Show All 192 Lines • ▼ Show 20 Lines	#endif
case ISD::VECREDUCE_OR:		case ISD::VECREDUCE_OR:
case ISD::VECREDUCE_XOR:		case ISD::VECREDUCE_XOR:
case ISD::VECREDUCE_SMAX:		case ISD::VECREDUCE_SMAX:
case ISD::VECREDUCE_SMIN:		case ISD::VECREDUCE_SMIN:
case ISD::VECREDUCE_UMAX:		case ISD::VECREDUCE_UMAX:
case ISD::VECREDUCE_UMIN:		case ISD::VECREDUCE_UMIN:
Res = PromoteIntRes_VECREDUCE(N);		Res = PromoteIntRes_VECREDUCE(N);
break;		break;

		case ISD::FREEZE:
		Res = PromoteIntRes_FREEZE(N);
		break;
}		}

// If the result is null then the sub-method took care of registering it.		// If the result is null then the sub-method took care of registering it.
if (Res.getNode())		if (Res.getNode())
SetPromotedInteger(SDValue(N, ResNo), Res);		SetPromotedInteger(SDValue(N, ResNo), Res);
}		}

SDValue DAGTypeLegalizer::PromoteIntRes_MERGE_VALUES(SDNode *N,		SDValue DAGTypeLegalizer::PromoteIntRes_MERGE_VALUES(SDNode *N,
▲ Show 20 Lines • Show All 181 Lines • ▼ Show 20 Lines	static EVT getShiftAmountTyForConstant(EVT VT, const TargetLowering &TLI,
// If any possible shift value won't fit in the prefered type, just use		// If any possible shift value won't fit in the prefered type, just use
// something safe. It will be legalized when the shift is expanded.		// something safe. It will be legalized when the shift is expanded.
if (!ShiftVT.isVector() &&		if (!ShiftVT.isVector() &&
ShiftVT.getSizeInBits() < Log2_32_Ceil(VT.getSizeInBits()))		ShiftVT.getSizeInBits() < Log2_32_Ceil(VT.getSizeInBits()))
ShiftVT = MVT::i32;		ShiftVT = MVT::i32;
return ShiftVT;		return ShiftVT;
}		}

		SDValue DAGTypeLegalizer::PromoteIntRes_FREEZE(SDNode *N) {
		SDValue V = GetPromotedInteger(N->getOperand(0));
		return DAG.getNode(ISD::FREEZE, SDLoc(N),
		V.getValueType(), V);
		}

SDValue DAGTypeLegalizer::PromoteIntRes_BSWAP(SDNode *N) {		SDValue DAGTypeLegalizer::PromoteIntRes_BSWAP(SDNode *N) {
SDValue Op = GetPromotedInteger(N->getOperand(0));		SDValue Op = GetPromotedInteger(N->getOperand(0));
EVT OVT = N->getValueType(0);		EVT OVT = N->getValueType(0);
EVT NVT = Op.getValueType();		EVT NVT = Op.getValueType();
SDLoc dl(N);		SDLoc dl(N);

unsigned DiffBits = NVT.getScalarSizeInBits() - OVT.getScalarSizeInBits();		unsigned DiffBits = NVT.getScalarSizeInBits() - OVT.getScalarSizeInBits();
EVT ShiftVT = getShiftAmountTyForConstant(NVT, TLI, DAG);		EVT ShiftVT = getShiftAmountTyForConstant(NVT, TLI, DAG);
▲ Show 20 Lines • Show All 1,373 Lines • ▼ Show 20 Lines
#endif		#endif
report_fatal_error("Do not know how to expand the result of this "		report_fatal_error("Do not know how to expand the result of this "
"operator!");		"operator!");

case ISD::MERGE_VALUES: SplitRes_MERGE_VALUES(N, ResNo, Lo, Hi); break;		case ISD::MERGE_VALUES: SplitRes_MERGE_VALUES(N, ResNo, Lo, Hi); break;
case ISD::SELECT: SplitRes_SELECT(N, Lo, Hi); break;		case ISD::SELECT: SplitRes_SELECT(N, Lo, Hi); break;
case ISD::SELECT_CC: SplitRes_SELECT_CC(N, Lo, Hi); break;		case ISD::SELECT_CC: SplitRes_SELECT_CC(N, Lo, Hi); break;
case ISD::UNDEF: SplitRes_UNDEF(N, Lo, Hi); break;		case ISD::UNDEF: SplitRes_UNDEF(N, Lo, Hi); break;
		case ISD::FREEZE: SplitRes_FREEZE(N, Lo, Hi); break;

case ISD::BITCAST: ExpandRes_BITCAST(N, Lo, Hi); break;		case ISD::BITCAST: ExpandRes_BITCAST(N, Lo, Hi); break;
case ISD::BUILD_PAIR: ExpandRes_BUILD_PAIR(N, Lo, Hi); break;		case ISD::BUILD_PAIR: ExpandRes_BUILD_PAIR(N, Lo, Hi); break;
case ISD::EXTRACT_ELEMENT: ExpandRes_EXTRACT_ELEMENT(N, Lo, Hi); break;		case ISD::EXTRACT_ELEMENT: ExpandRes_EXTRACT_ELEMENT(N, Lo, Hi); break;
case ISD::EXTRACT_VECTOR_ELT: ExpandRes_EXTRACT_VECTOR_ELT(N, Lo, Hi); break;		case ISD::EXTRACT_VECTOR_ELT: ExpandRes_EXTRACT_VECTOR_ELT(N, Lo, Hi); break;
case ISD::VAARG: ExpandRes_VAARG(N, Lo, Hi); break;		case ISD::VAARG: ExpandRes_VAARG(N, Lo, Hi); break;

case ISD::ANY_EXTEND: ExpandIntRes_ANY_EXTEND(N, Lo, Hi); break;		case ISD::ANY_EXTEND: ExpandIntRes_ANY_EXTEND(N, Lo, Hi); break;
▲ Show 20 Lines • Show All 2,643 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

Show First 20 Lines • Show All 298 Lines • ▼ Show 20 Lines	private:
SDValue PromoteIntRes_BUILD_PAIR(SDNode *N);		SDValue PromoteIntRes_BUILD_PAIR(SDNode *N);
SDValue PromoteIntRes_Constant(SDNode *N);		SDValue PromoteIntRes_Constant(SDNode *N);
SDValue PromoteIntRes_CTLZ(SDNode *N);		SDValue PromoteIntRes_CTLZ(SDNode *N);
SDValue PromoteIntRes_CTPOP(SDNode *N);		SDValue PromoteIntRes_CTPOP(SDNode *N);
SDValue PromoteIntRes_CTTZ(SDNode *N);		SDValue PromoteIntRes_CTTZ(SDNode *N);
SDValue PromoteIntRes_EXTRACT_VECTOR_ELT(SDNode *N);		SDValue PromoteIntRes_EXTRACT_VECTOR_ELT(SDNode *N);
SDValue PromoteIntRes_FP_TO_XINT(SDNode *N);		SDValue PromoteIntRes_FP_TO_XINT(SDNode *N);
SDValue PromoteIntRes_FP_TO_FP16(SDNode *N);		SDValue PromoteIntRes_FP_TO_FP16(SDNode *N);
		SDValue PromoteIntRes_FREEZE(SDNode *N);
SDValue PromoteIntRes_INT_EXTEND(SDNode *N);		SDValue PromoteIntRes_INT_EXTEND(SDNode *N);
SDValue PromoteIntRes_LOAD(LoadSDNode *N);		SDValue PromoteIntRes_LOAD(LoadSDNode *N);
SDValue PromoteIntRes_MLOAD(MaskedLoadSDNode *N);		SDValue PromoteIntRes_MLOAD(MaskedLoadSDNode *N);
SDValue PromoteIntRes_MGATHER(MaskedGatherSDNode *N);		SDValue PromoteIntRes_MGATHER(MaskedGatherSDNode *N);
SDValue PromoteIntRes_Overflow(SDNode *N);		SDValue PromoteIntRes_Overflow(SDNode *N);
SDValue PromoteIntRes_SADDSUBO(SDNode *N, unsigned ResNo);		SDValue PromoteIntRes_SADDSUBO(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_SELECT(SDNode *N);		SDValue PromoteIntRes_SELECT(SDNode *N);
SDValue PromoteIntRes_VSELECT(SDNode *N);		SDValue PromoteIntRes_VSELECT(SDNode *N);
▲ Show 20 Lines • Show All 598 Lines • ▼ Show 20 Lines	private:
void GetPairElements(SDValue Pair, SDValue &Lo, SDValue &Hi);		void GetPairElements(SDValue Pair, SDValue &Lo, SDValue &Hi);

// Generic Result Splitting.		// Generic Result Splitting.
void SplitRes_MERGE_VALUES(SDNode *N, unsigned ResNo,		void SplitRes_MERGE_VALUES(SDNode *N, unsigned ResNo,
SDValue &Lo, SDValue &Hi);		SDValue &Lo, SDValue &Hi);
void SplitRes_SELECT (SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitRes_SELECT (SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitRes_SELECT_CC (SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitRes_SELECT_CC (SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitRes_UNDEF (SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitRes_UNDEF (SDNode *N, SDValue &Lo, SDValue &Hi);
		void SplitRes_FREEZE (SDNode *N, SDValue &Lo, SDValue &Hi);

void SplitVSETCC(const SDNode *N);		void SplitVSETCC(const SDNode *N);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Generic Expansion: LegalizeTypesGeneric.cpp		// Generic Expansion: LegalizeTypesGeneric.cpp
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

// Legalization methods which only use that the illegal type is split into two		// Legalization methods which only use that the illegal type is split into two
Show All 39 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeTypesGeneric.cpp

	Show First 20 Lines • Show All 551 Lines • ▼ Show 20 Lines
	}			}

	void DAGTypeLegalizer::SplitRes_UNDEF(SDNode *N, SDValue &Lo, SDValue &Hi) {			void DAGTypeLegalizer::SplitRes_UNDEF(SDNode *N, SDValue &Lo, SDValue &Hi) {
	EVT LoVT, HiVT;			EVT LoVT, HiVT;
	std::tie(LoVT, HiVT) = DAG.GetSplitDestVTs(N->getValueType(0));			std::tie(LoVT, HiVT) = DAG.GetSplitDestVTs(N->getValueType(0));
	Lo = DAG.getUNDEF(LoVT);			Lo = DAG.getUNDEF(LoVT);
	Hi = DAG.getUNDEF(HiVT);			Hi = DAG.getUNDEF(HiVT);
	}			}

				void DAGTypeLegalizer::SplitRes_FREEZE(SDNode *N, SDValue &Lo, SDValue &Hi) {
				SDValue L, H;
				SDLoc dl(N);
				GetSplitOp(N->getOperand(0), L, H);

				Lo = DAG.getNode(ISD::FREEZE, dl, L.getValueType(), L);
				Hi = DAG.getNode(ISD::FREEZE, dl, H.getValueType(), H);
				}

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	#endif
case ISD::FEXP:		case ISD::FEXP:
case ISD::FEXP2:		case ISD::FEXP2:
case ISD::FFLOOR:		case ISD::FFLOOR:
case ISD::FLOG:		case ISD::FLOG:
case ISD::FLOG10:		case ISD::FLOG10:
case ISD::FLOG2:		case ISD::FLOG2:
case ISD::FNEARBYINT:		case ISD::FNEARBYINT:
case ISD::FNEG:		case ISD::FNEG:
		case ISD::FREEZE:
case ISD::FP_EXTEND:		case ISD::FP_EXTEND:
case ISD::FP_TO_SINT:		case ISD::FP_TO_SINT:
case ISD::FP_TO_UINT:		case ISD::FP_TO_UINT:
case ISD::FRINT:		case ISD::FRINT:
case ISD::FROUND:		case ISD::FROUND:
case ISD::FSIN:		case ISD::FSIN:
case ISD::FSQRT:		case ISD::FSQRT:
case ISD::FTRUNC:		case ISD::FTRUNC:
▲ Show 20 Lines • Show All 772 Lines • ▼ Show 20 Lines	#endif
case ISD::FEXP:		case ISD::FEXP:
case ISD::FEXP2:		case ISD::FEXP2:
case ISD::FFLOOR:		case ISD::FFLOOR:
case ISD::FLOG:		case ISD::FLOG:
case ISD::FLOG10:		case ISD::FLOG10:
case ISD::FLOG2:		case ISD::FLOG2:
case ISD::FNEARBYINT:		case ISD::FNEARBYINT:
case ISD::FNEG:		case ISD::FNEG:
		case ISD::FREEZE:
case ISD::FP_EXTEND:		case ISD::FP_EXTEND:
case ISD::FP_ROUND:		case ISD::FP_ROUND:
case ISD::FP_TO_SINT:		case ISD::FP_TO_SINT:
case ISD::FP_TO_UINT:		case ISD::FP_TO_UINT:
case ISD::FRINT:		case ISD::FRINT:
case ISD::FROUND:		case ISD::FROUND:
case ISD::FSIN:		case ISD::FSIN:
case ISD::FSQRT:		case ISD::FSQRT:
▲ Show 20 Lines • Show All 1,939 Lines • ▼ Show 20 Lines	#include "llvm/IR/ConstrainedOps.def"
case ISD::BITREVERSE:		case ISD::BITREVERSE:
case ISD::BSWAP:		case ISD::BSWAP:
case ISD::CTLZ:		case ISD::CTLZ:
case ISD::CTLZ_ZERO_UNDEF:		case ISD::CTLZ_ZERO_UNDEF:
case ISD::CTPOP:		case ISD::CTPOP:
case ISD::CTTZ:		case ISD::CTTZ:
case ISD::CTTZ_ZERO_UNDEF:		case ISD::CTTZ_ZERO_UNDEF:
case ISD::FNEG:		case ISD::FNEG:
		case ISD::FREEZE:
case ISD::FCANONICALIZE:		case ISD::FCANONICALIZE:
Res = WidenVecRes_Unary(N);		Res = WidenVecRes_Unary(N);
break;		break;
case ISD::FMA:		case ISD::FMA:
Res = WidenVecRes_Ternary(N);		Res = WidenVecRes_Ternary(N);
break;		break;
}		}

▲ Show 20 Lines • Show All 2,337 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,095 Lines • ▼ Show 20 Lines	for (const auto *U : User->users()) {
// There is only one user of this ShuffleVector instruction, which		// There is only one user of this ShuffleVector instruction, which
// must be a reduction operation.		// must be a reduction operation.
if (!U->hasOneUse())		if (!U->hasOneUse())
return false;		return false;

auto U2 = dyn_cast<Instruction>(*U->user_begin());		auto U2 = dyn_cast<Instruction>(*U->user_begin());
if (!U2 \|\| U2->getOpcode() != OpCode)		if (!U2 \|\| U2->getOpcode() != OpCode)
return false;		return false;

craig.topperUnsubmitted Done Reply Inline Actions Drop this change? craig.topper: Drop this change?
// Check operands of the reduction operation.		// Check operands of the reduction operation.
if ((U2->getOperand(0) == U->getOperand(0) && U2->getOperand(1) == U) \|\|		if ((U2->getOperand(0) == U->getOperand(0) && U2->getOperand(1) == U) \|\|
		craig.topperUnsubmitted Done Reply Inline Actions Separate this into just visitFreeze. craig.topper: Separate this into just visitFreeze.
(U2->getOperand(1) == U->getOperand(0) && U2->getOperand(0) == U)) {		(U2->getOperand(1) == U->getOperand(0) && U2->getOperand(0) == U)) {
UsersToVisit.push_back(U2);		UsersToVisit.push_back(U2);
ElemNumToReduce /= 2;		ElemNumToReduce /= 2;
} else		} else
return false;		return false;
} else if (isa<ExtractElementInst>(U)) {		} else if (isa<ExtractElementInst>(U)) {
// At this moment we should have reduced all elements in the vector.		// At this moment we should have reduced all elements in the vector.
if (ElemNumToReduce != 1)		if (ElemNumToReduce != 1)
▲ Show 20 Lines • Show All 7,492 Lines • ▼ Show 20 Lines	if (NumClusters > 3 && TM.getOptLevel() != CodeGenOpt::None &&
continue;		continue;
}		}

lowerWorkItem(W, SI.getCondition(), SwitchMBB, DefaultMBB);		lowerWorkItem(W, SI.getCondition(), SwitchMBB, DefaultMBB);
}		}
}		}

void SelectionDAGBuilder::visitFreeze(const FreezeInst &I) {		void SelectionDAGBuilder::visitFreeze(const FreezeInst &I) {
SDValue N = getValue(I.getOperand(0));		SDNodeFlags Flags;
setValue(&I, N);
		SDValue Op = getValue(I.getOperand(0));
		if (I.getOperand(0)->getType()->isAggregateType()) {
		EVT VT = Op.getValueType();
		SmallVector<SDValue, 1> Values;
		for (unsigned i = 0; i < Op.getNumOperands(); ++i) {
		SDValue Arg(Op.getNode(), i);
		SDValue UnNodeValue = DAG.getNode(ISD::FREEZE, getCurSDLoc(), VT, Arg, Flags);
		Values.push_back(UnNodeValue);
		}
		SDValue MergedValue = DAG.getMergeValues(Values, getCurSDLoc());
		setValue(&I, MergedValue);
		} else {
		SDValue UnNodeValue = DAG.getNode(ISD::FREEZE, getCurSDLoc(), Op.getValueType(),
		Op, Flags);
		setValue(&I, UnNodeValue);
		}
}		}

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 381 Lines • ▼ Show 20 Lines	#endif
case ISD::STACKRESTORE: return "stackrestore";		case ISD::STACKRESTORE: return "stackrestore";
case ISD::TRAP: return "trap";		case ISD::TRAP: return "trap";
case ISD::DEBUGTRAP: return "debugtrap";		case ISD::DEBUGTRAP: return "debugtrap";
case ISD::LIFETIME_START: return "lifetime.start";		case ISD::LIFETIME_START: return "lifetime.start";
case ISD::LIFETIME_END: return "lifetime.end";		case ISD::LIFETIME_END: return "lifetime.end";
case ISD::GC_TRANSITION_START: return "gc_transition.start";		case ISD::GC_TRANSITION_START: return "gc_transition.start";
case ISD::GC_TRANSITION_END: return "gc_transition.end";		case ISD::GC_TRANSITION_END: return "gc_transition.end";
case ISD::GET_DYNAMIC_AREA_OFFSET: return "get.dynamic.area.offset";		case ISD::GET_DYNAMIC_AREA_OFFSET: return "get.dynamic.area.offset";
		case ISD::FREEZE: return "freeze";

// Bit manipulation		// Bit manipulation
case ISD::ABS: return "abs";		case ISD::ABS: return "abs";
case ISD::BITREVERSE: return "bitreverse";		case ISD::BITREVERSE: return "bitreverse";
case ISD::BSWAP: return "bswap";		case ISD::BSWAP: return "bswap";
case ISD::CTPOP: return "ctpop";		case ISD::CTPOP: return "ctpop";
case ISD::CTTZ: return "cttz";		case ISD::CTTZ: return "cttz";
case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";		case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";
▲ Show 20 Lines • Show All 586 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

Show First 20 Lines • Show All 2,279 Lines • ▼ Show 20 Lines	void SelectionDAGISel::Select_WRITE_REGISTER(SDNode *Op) {
ReplaceUses(Op, New.getNode());		ReplaceUses(Op, New.getNode());
CurDAG->RemoveDeadNode(Op);		CurDAG->RemoveDeadNode(Op);
}		}

void SelectionDAGISel::Select_UNDEF(SDNode *N) {		void SelectionDAGISel::Select_UNDEF(SDNode *N) {
CurDAG->SelectNodeTo(N, TargetOpcode::IMPLICIT_DEF, N->getValueType(0));		CurDAG->SelectNodeTo(N, TargetOpcode::IMPLICIT_DEF, N->getValueType(0));
}		}

		void SelectionDAGISel::Select_FREEZE(SDNode *N) {
		CurDAG->SelectNodeTo(N, TargetOpcode::FREEZE, N->getValueType(0),
		N->getOperand(0));
		}

/// GetVBR - decode a vbr encoding whose top bit is set.		/// GetVBR - decode a vbr encoding whose top bit is set.
LLVM_ATTRIBUTE_ALWAYS_INLINE static inline uint64_t		LLVM_ATTRIBUTE_ALWAYS_INLINE static inline uint64_t
GetVBR(uint64_t Val, const unsigned char *MatcherTable, unsigned &Idx) {		GetVBR(uint64_t Val, const unsigned char *MatcherTable, unsigned &Idx) {
assert(Val >= 128 && "Not a VBR");		assert(Val >= 128 && "Not a VBR");
Val &= 127; // Remove first vbr bit.		Val &= 127; // Remove first vbr bit.

unsigned Shift = 7;		unsigned Shift = 7;
uint64_t NextBits;		uint64_t NextBits;
▲ Show 20 Lines • Show All 520 Lines • ▼ Show 20 Lines	case ISD::READ_REGISTER:
Select_READ_REGISTER(NodeToMatch);		Select_READ_REGISTER(NodeToMatch);
return;		return;
case ISD::WRITE_REGISTER:		case ISD::WRITE_REGISTER:
Select_WRITE_REGISTER(NodeToMatch);		Select_WRITE_REGISTER(NodeToMatch);
return;		return;
case ISD::UNDEF:		case ISD::UNDEF:
Select_UNDEF(NodeToMatch);		Select_UNDEF(NodeToMatch);
return;		return;
		case ISD::FREEZE:
		Select_FREEZE(NodeToMatch);
		return;
}		}

assert(!NodeToMatch->isMachineOpcode() && "Node already selected!");		assert(!NodeToMatch->isMachineOpcode() && "Node already selected!");

// Set up the node stack with NodeToMatch as the only node on the stack.		// Set up the node stack with NodeToMatch as the only node on the stack.
SmallVector<SDValue, 8> NodeStack;		SmallVector<SDValue, 8> NodeStack;
SDValue N = SDValue(NodeToMatch, 0);		SDValue N = SDValue(NodeToMatch, 0);
NodeStack.push_back(N);		NodeStack.push_back(N);
▲ Show 20 Lines • Show All 899 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 1,639 Lines • ▼ Show 20 Lines	#include "llvm/IR/Instruction.def"
case UserOp2: return 0;		case UserOp2: return 0;
case VAArg: return 0;		case VAArg: return 0;
case ExtractElement: return ISD::EXTRACT_VECTOR_ELT;		case ExtractElement: return ISD::EXTRACT_VECTOR_ELT;
case InsertElement: return ISD::INSERT_VECTOR_ELT;		case InsertElement: return ISD::INSERT_VECTOR_ELT;
case ShuffleVector: return ISD::VECTOR_SHUFFLE;		case ShuffleVector: return ISD::VECTOR_SHUFFLE;
case ExtractValue: return ISD::MERGE_VALUES;		case ExtractValue: return ISD::MERGE_VALUES;
case InsertValue: return ISD::MERGE_VALUES;		case InsertValue: return ISD::MERGE_VALUES;
case LandingPad: return 0;		case LandingPad: return 0;
case Freeze: return 0;		case Freeze: return ISD::FREEZE;
}		}

llvm_unreachable("Unknown instruction type encountered!");		llvm_unreachable("Unknown instruction type encountered!");
}		}

std::pair<int, MVT>		std::pair<int, MVT>
TargetLoweringBase::getTypeLegalizationCost(const DataLayout &DL,		TargetLoweringBase::getTypeLegalizationCost(const DataLayout &DL,
Type *Ty) const {		Type *Ty) const {
▲ Show 20 Lines • Show All 407 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TargetPassConfig.cpp

Show First 20 Lines • Show All 921 Lines • ▼ Show 20 Lines	void TargetPassConfig::addMachinePasses() {
if (!isPassSubstitutedOrOverridden(&PrologEpilogCodeInserterID))		if (!isPassSubstitutedOrOverridden(&PrologEpilogCodeInserterID))
addPass(createPrologEpilogInserterPass());		addPass(createPrologEpilogInserterPass());

/// Add passes that optimize machine instructions after register allocation.		/// Add passes that optimize machine instructions after register allocation.
if (getOptLevel() != CodeGenOpt::None)		if (getOptLevel() != CodeGenOpt::None)
addMachineLateOptimization();		addMachineLateOptimization();

// Expand pseudo instructions before second scheduling pass.		// Expand pseudo instructions before second scheduling pass.
		// After this pass, IMPLICIT_DEF cannot yield different values per use.
		// For example, following transformation is not valid anymore:
		// eax = IMPLICIT_DEF
		// use(eax)
		// use(eax)
		// =>
		// eax = IMPLICIT_DEF
		// use(eax)
		// ebx = IMPLICIT_DEF
		// use(ebx)
addPass(&ExpandPostRAPseudosID);		addPass(&ExpandPostRAPseudosID);

// Run pre-sched2 passes.		// Run pre-sched2 passes.
addPreSched2();		addPreSched2();

if (EnableImplicitNullChecks)		if (EnableImplicitNullChecks)
addPass(&ImplicitNullChecksID);		addPass(&ImplicitNullChecksID);

▲ Show 20 Lines • Show All 303 Lines • Show Last 20 Lines

llvm/lib/Target/AVR/AVRInstrInfo.cpp

Show First 20 Lines • Show All 479 Lines • ▼ Show 20 Lines	unsigned AVRInstrInfo::getInstSizeInBytes(const MachineInstr &MI) const {
// A regular instruction		// A regular instruction
default: {		default: {
const MCInstrDesc &Desc = get(Opcode);		const MCInstrDesc &Desc = get(Opcode);
return Desc.getSize();		return Desc.getSize();
}		}
case TargetOpcode::EH_LABEL:		case TargetOpcode::EH_LABEL:
case TargetOpcode::IMPLICIT_DEF:		case TargetOpcode::IMPLICIT_DEF:
case TargetOpcode::KILL:		case TargetOpcode::KILL:
		case TargetOpcode::FREEZE:
case TargetOpcode::DBG_VALUE:		case TargetOpcode::DBG_VALUE:
return 0;		return 0;
case TargetOpcode::INLINEASM:		case TargetOpcode::INLINEASM:
case TargetOpcode::INLINEASM_BR: {		case TargetOpcode::INLINEASM_BR: {
const MachineFunction &MF = *MI.getParent()->getParent();		const MachineFunction &MF = *MI.getParent()->getParent();
const AVRTargetMachine &TM = static_cast<const AVRTargetMachine&>(MF.getTarget());		const AVRTargetMachine &TM = static_cast<const AVRTargetMachine&>(MF.getTarget());
const AVRSubtarget &STI = MF.getSubtarget<AVRSubtarget>();		const AVRSubtarget &STI = MF.getSubtarget<AVRSubtarget>();
const TargetInstrInfo &TII = *STI.getInstrInfo();		const TargetInstrInfo &TII = *STI.getInstrInfo();
▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/fast-isel-freeze.ll

This file was added.

				; RUN: llc < %s -mtriple=x86_64-unknown-linux \| FileCheck %s --check-prefix=SDAG
				; RUN: llc < %s -fast-isel -fast-isel-abort=1 -mtriple=x86_64-unknown-linux \| FileCheck %s --check-prefix=FAST

				define i32 @freeze(i32 %t) {
				; SDAG: movl $10, %eax
				; SDAG-NEXT: xorl %ecx, %eax
				; SDAG-NEXT: retq
				; FAST: movl $10, %eax
				; FAST-NEXT: xorl %ecx, %eax
				; FAST-NEXT: retq
				%1 = freeze i32 %t
				%2 = freeze i32 10
				%3 = xor i32 %1, %2
				ret i32 %3
				}

llvm/test/CodeGen/X86/fast-isel.ll

	Show First 20 Lines • Show All 93 Lines • ▼ Show 20 Lines
	}			}

	define void @load_store_i1(i1* %p, i1* %q) nounwind {			define void @load_store_i1(i1* %p, i1* %q) nounwind {
	%t = load i1, i1* %p			%t = load i1, i1* %p
	store i1 %t, i1* %q			store i1 %t, i1* %q
	ret void			ret void
	}			}

				define void @freeze_i32(i32 %x) {
				%t = freeze i32 %x
				ret void
				}

	@crash_test1x = external global <2 x i32>, align 8			@crash_test1x = external global <2 x i32>, align 8

	define void @crash_test1() nounwind ssp {			define void @crash_test1() nounwind ssp {
	%tmp = load <2 x i32>, <2 x i32>* @crash_test1x, align 8			%tmp = load <2 x i32>, <2 x i32>* @crash_test1x, align 8
	%neg = xor <2 x i32> %tmp, <i32 -1, i32 -1>			%neg = xor <2 x i32> %tmp, <i32 -1, i32 -1>
	ret void			ret void
	}			}

	Show All 18 Lines

llvm/test/CodeGen/X86/freeze-call.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc -mtriple=x86_64-unknown-linux-gnu < %s 2>&1 \| FileCheck %s --check-prefix=X86ASM
				; RUN: llc -mtriple=x86_64-unknown-linux-gnu -optimize-regalloc=false < %s 2>&1 \| FileCheck %s --check-prefix=X86ASM_NORAOPT

				declare i32 @g()

				define i32 @foo() nounwind {
				; X86ASM-LABEL: foo:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: pushq %rbx
				; X86ASM-NEXT: # kill: def $ebx killed $eax def $rbx
				; X86ASM-NEXT: callq g
				; X86ASM-NEXT: leal 30(%rbx,%rbx), %eax
				lebedev.riUnsubmitted Done Reply Inline Actions You can avoid cfi noise via `define i32 @foo() nounwind {` lebedev.ri: You can avoid cfi noise via `define i32 @foo() nounwind {`
				; X86ASM-NEXT: popq %rbx
				; X86ASM-NEXT: retq
				;
				; X86ASM_NORAOPT-LABEL: foo:
				; X86ASM_NORAOPT: # %bb.0:
				; X86ASM_NORAOPT-NEXT: pushq %rax
				; X86ASM_NORAOPT-NEXT: # implicit-def: $eax
				; X86ASM_NORAOPT-NEXT: movl %eax, {{[-0-9]+}}(%r{{[sb]}}p) # 4-byte Spill
				; X86ASM_NORAOPT-NEXT: callq g
				; X86ASM_NORAOPT-NEXT: # implicit-def: $rcx
				; X86ASM_NORAOPT-NEXT: movl {{[-0-9]+}}(%r{{[sb]}}p), %edx # 4-byte Reload
				; X86ASM_NORAOPT-NEXT: movl %edx, %ecx
				; X86ASM_NORAOPT-NEXT: leal 30(%rcx,%rcx), %eax
				; X86ASM_NORAOPT-NEXT: popq %rcx
				; X86ASM_NORAOPT-NEXT: retq
				%y1 = freeze i32 undef
				%k = add i32 %y1, 10
				call i32 @g()
				%k2 = add i32 %y1, 20
				%res = add i32 %k, %k2
				ret i32 %res
				}

llvm/test/CodeGen/X86/freeze-legalize.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; Make sure that seldag legalization works correctly for freeze instruction.
				; RUN: llc -mtriple=i386-apple-darwin < %s 2>&1 \| FileCheck %s

				define i64 @expand(i32 %x) {
				; CHECK-LABEL: expand:
				; CHECK: ## %bb.0:
				; CHECK-NEXT: movl $303174162, %eax ## imm = 0x12121212
				; CHECK-NEXT: movl $875836468, %ecx ## imm = 0x34343434
				; CHECK-NEXT: movl $1448498774, %edx ## imm = 0x56565656
				; CHECK-NEXT: xorl %eax, %edx
				; CHECK-NEXT: movl $2021161080, %eax ## imm = 0x78787878
				; CHECK-NEXT: xorl %ecx, %eax
				spatelUnsubmitted Not Done Reply Inline Actions Do we need to add basic simplify / constant folding for SDAG ? freeze ( Constant ) --> Constant spatel: Do we need to add basic simplify / constant folding for SDAG ? freeze ( Constant ) --> Constant
				aqjuneAuthorUnsubmitted Done Reply Inline Actions Yes, I agree it will be great. Do you want to make this patch contain the change as well? aqjune: Yes, I agree it will be great. Do you want to make this patch contain the change as well?
				aqjuneAuthorUnsubmitted Done Reply Inline Actions Or I can land this first, and add the simplify / constant folding for SDAG. I prefer incrementally making things because this patch itself is a big change. aqjune: Or I can land this first, and add the simplify / constant folding for SDAG. I prefer…
				spatelUnsubmitted Not Done Reply Inline Actions I prefer smaller patches too. Let's make that a follow-up. Is it correct that there is very little chance that this patch will create a visible performance regression (because there should be almost no freeze instruction creation in IR yet)? spatel: I prefer smaller patches too. Let's make that a follow-up. Is it correct that there is very…
				aqjuneAuthorUnsubmitted Done Reply Inline Actions Yes, it is. Currently there is only one place where freeze is introduced - https://reviews.llvm.org/D76179 I checked that from assembly outputs of LLVM test-suite , only 3 / 5239 files are affected by this patch. aqjune: Yes, it is. Currently there is only one place where freeze is introduced - https://reviews.llvm.
				spatelUnsubmitted Not Done Reply Inline Actions That sounds good then. But can we avoid those 3 regressions cases before or within this patch? Ideally, we don't want to knowingly regress anything. spatel: That sounds good then. But can we avoid those 3 regressions cases before or within this patch?
				aqjuneAuthorUnsubmitted Done Reply Inline Actions Among 3 files, two were simple regressions that had a bit more verbose assembly: ... sete %al orb %bpl, %al jne .LBB9_1 => ... sete %al orb %bpl, %al testb $1, %al jne .LBB9_1 ... testb $1, %al je .LBB0_2 => ... movl %eax, %ecx andl $1, %ecx je .LBB0_2 Case 3's assembly diff was bigger, so needs inspection. I can visit it after the simpler two cases are resolved. aqjune: Among 3 files, two were simple regressions that had a bit more verbose assembly: ``` ... sete…
				; CHECK-NEXT: retl
				%y1 = freeze i64 1302123111658042420 ; 0x1212121234343434
				%y2 = freeze i64 6221254864647256184 ; 0x5656565678787878
				%t2 = xor i64 %y1, %y2
				ret i64 %t2
				}


				define <2 x i64> @expand_vec(i32 %x) nounwind {
				; CHECK-LABEL: expand_vec:
				; CHECK: ## %bb.0:
				; CHECK-NEXT: pushl %ebx
				; CHECK-NEXT: pushl %edi
				; CHECK-NEXT: pushl %esi
				; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax
				; CHECK-NEXT: movl $16843009, %ecx ## imm = 0x1010101
				; CHECK-NEXT: movl $589505315, %edx ## imm = 0x23232323
				; CHECK-NEXT: movl $303174162, %esi ## imm = 0x12121212
				; CHECK-NEXT: movl $875836468, %edi ## imm = 0x34343434
				; CHECK-NEXT: movl $1162167621, %ebx ## imm = 0x45454545
				; CHECK-NEXT: xorl %ecx, %ebx
				; CHECK-NEXT: movl $1734829927, %ecx ## imm = 0x67676767
				; CHECK-NEXT: xorl %edx, %ecx
				; CHECK-NEXT: movl $1448498774, %edx ## imm = 0x56565656
				; CHECK-NEXT: xorl %esi, %edx
				; CHECK-NEXT: movl $2021161080, %esi ## imm = 0x78787878
				; CHECK-NEXT: xorl %edi, %esi
				; CHECK-NEXT: movl %ebx, 12(%eax)
				; CHECK-NEXT: movl %ecx, 8(%eax)
				; CHECK-NEXT: movl %edx, 4(%eax)
				; CHECK-NEXT: movl %esi, (%eax)
				; CHECK-NEXT: popl %esi
				; CHECK-NEXT: popl %edi
				; CHECK-NEXT: popl %ebx
				; CHECK-NEXT: retl $4
				; <0x1212121234343434, 0x101010123232323>
				%y1 = freeze <2 x i64> <i64 1302123111658042420, i64 72340173410738979>
				; <0x5656565678787878, 0x4545454567676767>
				%y2 = freeze <2 x i64> <i64 6221254864647256184, i64 4991471926399952743>
				%t2 = xor <2 x i64> %y1, %y2
				ret <2 x i64> %t2
				}

				define i10 @promote() {
				; CHECK-LABEL: promote:
				; CHECK: ## %bb.0:
				; CHECK-NEXT: movw $682, %ax ## imm = 0x2AA
				; CHECK-NEXT: movl %eax, %ecx
				; CHECK-NEXT: movw $992, %ax ## imm = 0x3E0
				; CHECK-NEXT: ## kill: def $ax killed $ax def $eax
				; CHECK-NEXT: addl %ecx, %eax
				; CHECK-NEXT: ## kill: def $ax killed $ax killed $eax
				; CHECK-NEXT: retl
				%a = freeze i10 682
				%b = freeze i10 992
				%res = add i10 %a, %b
				ret i10 %res
				}

				define <2 x i10> @promote_vec() {
				; CHECK-LABEL: promote_vec:
				; CHECK: ## %bb.0:
				; CHECK-NEXT: movw $125, %ax
				; CHECK-NEXT: ## kill: def $ax killed $ax def $eax
				; CHECK-NEXT: movw $682, %cx ## imm = 0x2AA
				; CHECK-NEXT: ## kill: def $cx killed $cx def $ecx
				; CHECK-NEXT: movw $393, %dx ## imm = 0x189
				; CHECK-NEXT: ## kill: def $dx killed $dx def $edx
				; CHECK-NEXT: addl %eax, %edx
				; CHECK-NEXT: movw $992, %ax ## imm = 0x3E0
				; CHECK-NEXT: ## kill: def $ax killed $ax def $eax
				; CHECK-NEXT: addl %ecx, %eax
				; CHECK-NEXT: ## kill: def $ax killed $ax killed $eax
				; CHECK-NEXT: ## kill: def $dx killed $dx killed $edx
				; CHECK-NEXT: retl
				%a = freeze <2 x i10> <i10 682, i10 125>
				%b = freeze <2 x i10> <i10 992, i10 393>
				%res = add <2 x i10> %a, %b
				ret <2 x i10> %res
				}

llvm/test/CodeGen/X86/freeze-mir.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				; RUN: llc -mtriple=x86_64-unknown-linux-gnu -verify-machineinstrs -o - -stop-after=finalize-isel %s 2>&1 \| FileCheck %s --check-prefix=X86MIR
				aqjuneAuthorUnsubmitted Done Reply Inline Actions I made a separate `freeze-mir.ll` file for mir test because `update_mir_test_checks.py` did not work after `freeze.ll` is processed with `update_llc_test_checks.py` aqjune: I made a separate `freeze-mir.ll` file for mir test because `update_mir_test_checks.py` did not…

				%struct.T = type { i32, i32 }

				define i32 @freeze_int() {
				; X86MIR-LABEL: name: freeze_int
				; X86MIR: bb.0 (%ir-block.0):
				; X86MIR: [[DEF:%[0-9]+]]:gr32 = IMPLICIT_DEF
				; X86MIR: [[FREEZE:%[0-9]+]]:gr32 = FREEZE killed [[DEF]]
				; X86MIR: [[IMUL32rr:%[0-9]+]]:gr32 = IMUL32rr [[FREEZE]], [[FREEZE]], implicit-def dead $eflags
				; X86MIR: [[XOR32rr:%[0-9]+]]:gr32 = XOR32rr [[IMUL32rr]], [[FREEZE]], implicit-def dead $eflags
				; X86MIR: $eax = COPY [[XOR32rr]]
				; X86MIR: RET 0, $eax
				%y1 = freeze i32 undef
				%t1 = mul i32 %y1, %y1
				%t2 = xor i32 %t1, %y1
				ret i32 %t2
				}

				define i5 @freeze_int2() {
				; X86MIR-LABEL: name: freeze_int2
				; X86MIR: bb.0 (%ir-block.0):
				; X86MIR: [[DEF:%[0-9]+]]:gr8 = IMPLICIT_DEF
				; X86MIR: [[FREEZE:%[0-9]+]]:gr8 = FREEZE killed [[DEF]]
				; X86MIR: $al = COPY [[FREEZE]]
				; X86MIR: MUL8r [[FREEZE]], implicit-def $al, implicit-def dead $eflags, implicit-def $ax, implicit $al
				; X86MIR: [[COPY:%[0-9]+]]:gr8 = COPY $al
				; X86MIR: [[XOR8rr:%[0-9]+]]:gr8 = XOR8rr [[COPY]], [[FREEZE]], implicit-def dead $eflags
				; X86MIR: $al = COPY [[XOR8rr]]
				; X86MIR: RET 0, $al
				%y1 = freeze i5 undef
				%t1 = mul i5 %y1, %y1
				%t2 = xor i5 %t1, %y1
				ret i5 %t2
				}

				define float @freeze_float() {
				; X86MIR-LABEL: name: freeze_float
				; X86MIR: bb.0 (%ir-block.0):
				; X86MIR: [[DEF:%[0-9]+]]:fr32 = IMPLICIT_DEF
				; X86MIR: [[FREEZE:%[0-9]+]]:fr32 = FREEZE killed [[DEF]]
				; X86MIR: %2:fr32 = nofpexcept ADDSSrr [[FREEZE]], [[FREEZE]], implicit $mxcsr
				; X86MIR: $xmm0 = COPY %2
				; X86MIR: RET 0, $xmm0
				%y1 = freeze float undef
				%t1 = fadd float %y1, %y1
				ret float %t1
				}

				define <2 x i32> @freeze_ivec() {
				; X86MIR-LABEL: name: freeze_ivec
				; X86MIR: bb.0 (%ir-block.0):
				; X86MIR: [[DEF:%[0-9]+]]:vr128 = IMPLICIT_DEF
				; X86MIR: [[FREEZE:%[0-9]+]]:vr128 = FREEZE killed [[DEF]]
				; X86MIR: [[PADDDrr:%[0-9]+]]:vr128 = PADDDrr [[FREEZE]], [[FREEZE]]
				; X86MIR: $xmm0 = COPY [[PADDDrr]]
				; X86MIR: RET 0, $xmm0
				%y1 = freeze <2 x i32> undef
				%t1 = add <2 x i32> %y1, %y1
				ret <2 x i32> %t1
				}

				define i8* @freeze_ptr() {
				; X86MIR-LABEL: name: freeze_ptr
				; X86MIR: bb.0 (%ir-block.0):
				; X86MIR: [[DEF:%[0-9]+]]:gr64 = IMPLICIT_DEF
				; X86MIR: [[FREEZE:%[0-9]+]]:gr64 = FREEZE killed [[DEF]]
				; X86MIR: [[ADD64ri8_:%[0-9]+]]:gr64 = ADD64ri8 [[FREEZE]], 4, implicit-def dead $eflags
				; X86MIR: $rax = COPY [[ADD64ri8_]]
				; X86MIR: RET 0, $rax
				%y1 = freeze i8* undef
				%t1 = getelementptr i8, i8* %y1, i64 4
				ret i8* %t1
				}

				define i32 @freeze_struct() {
				; X86MIR-LABEL: name: freeze_struct
				; X86MIR: bb.0 (%ir-block.0):
				; X86MIR: [[DEF:%[0-9]+]]:gr32 = IMPLICIT_DEF
				; X86MIR: [[FREEZE:%[0-9]+]]:gr32 = FREEZE killed [[DEF]]
				; X86MIR: [[ADD32rr:%[0-9]+]]:gr32 = ADD32rr [[FREEZE]], [[FREEZE]], implicit-def dead $eflags
				; X86MIR: $eax = COPY [[ADD32rr]]
				; X86MIR: RET 0, $eax
				%y1 = freeze %struct.T undef
				%v1 = extractvalue %struct.T %y1, 0
				%v2 = extractvalue %struct.T %y1, 1
				%t1 = add i32 %v1, %v2
				ret i32 %t1
				}

				define i32 @freeze_anonstruct() {
				; X86MIR-LABEL: name: freeze_anonstruct
				; X86MIR: bb.0 (%ir-block.0):
				; X86MIR: [[DEF:%[0-9]+]]:gr32 = IMPLICIT_DEF
				; X86MIR: [[FREEZE:%[0-9]+]]:gr32 = FREEZE killed [[DEF]]
				; X86MIR: [[ADD32rr:%[0-9]+]]:gr32 = ADD32rr [[FREEZE]], [[FREEZE]], implicit-def dead $eflags
				; X86MIR: $eax = COPY [[ADD32rr]]
				; X86MIR: RET 0, $eax
				%y1 = freeze {i32, i32} undef
				%v1 = extractvalue {i32, i32} %y1, 0
				%v2 = extractvalue {i32, i32} %y1, 1
				%t1 = add i32 %v1, %v2
				ret i32 %t1
				}

				define i64 @freeze_array() {
				; X86MIR-LABEL: name: freeze_array
				; X86MIR: bb.0 (%ir-block.0):
				; X86MIR: [[DEF:%[0-9]+]]:gr64 = IMPLICIT_DEF
				; X86MIR: [[FREEZE:%[0-9]+]]:gr64 = FREEZE killed [[DEF]]
				; X86MIR: [[ADD64rr:%[0-9]+]]:gr64 = ADD64rr [[FREEZE]], [[FREEZE]], implicit-def dead $eflags
				; X86MIR: $rax = COPY [[ADD64rr]]
				; X86MIR: RET 0, $rax
				%y1 = freeze [2 x i64] undef
				%v1 = extractvalue [2 x i64] %y1, 0
				%v2 = extractvalue [2 x i64] %y1, 1
				%t1 = add i64 %v1, %v2
				ret i64 %t1
				}

llvm/test/CodeGen/X86/freeze-phielim.ll

This file was added.

				; RUN: llc -mtriple=x86_64-unknown-linux-gnu < %s 2>&1 \| FileCheck %s --check-prefix=X86ASM
				; RUN: llc -mtriple=x86_64-unknown-linux-gnu -optimize-regalloc=false < %s 2>&1 \| FileCheck %s --check-prefix=X86ASM_NORAOPT

				@x = global i32 0
				@y = global i32 0

				define void @f(i1 %cond) {
				; X86ASM-LABEL: f:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $eax killed $eax def $rax
				; X86ASM-NEXT: testb $1, %dil
				; X86ASM-NEXT: je .LBB0_2
				; X86ASM-NEXT: # %bb.1: # %BB1
				; X86ASM-NEXT: leal -1(%rax), %ecx
				; X86ASM-NEXT: jmp .LBB0_3
				; X86ASM-NEXT: .LBB0_2: # %BB2
				; X86ASM-NEXT: xorl %ecx, %ecx
				; X86ASM-NEXT: .LBB0_3: # %END
				; X86ASM-NEXT: movl %eax, {{.*}}(%rip)
				; X86ASM-NEXT: movl %ecx, {{.*}}(%rip)
				; X86ASM-NEXT: retq
				;
				; X86ASM_NORAOPT-LABEL: f:
				; X86ASM_NORAOPT: # %bb.0:
				; X86ASM_NORAOPT-NEXT: # kill: def $dil killed $dil killed $edi
				; X86ASM_NORAOPT-NEXT: # implicit-def: $eax
				; X86ASM_NORAOPT-NEXT: testb $1, %dil
				; X86ASM_NORAOPT-NEXT: je .LBB0_2
				; X86ASM_NORAOPT-NEXT: # %bb.1:
				; X86ASM_NORAOPT-NEXT: movl %eax, %ecx
				; X86ASM_NORAOPT-NEXT: leal -1(%rcx), %edx
				; X86ASM_NORAOPT-NEXT: movl %eax, -4(%rsp)
				; X86ASM_NORAOPT-NEXT: movl %edx, -8(%rsp)
				; X86ASM_NORAOPT-NEXT: jmp .LBB0_3
				; X86ASM_NORAOPT-NEXT:.LBB0_2:
				; X86ASM_NORAOPT-NEXT: xorl %ecx, %ecx
				; X86ASM_NORAOPT-NEXT: movl %eax, -4(%rsp)
				; X86ASM_NORAOPT-NEXT: movl %ecx, -8(%rsp)
				; X86ASM_NORAOPT-NEXT:.LBB0_3:
				; X86ASM_NORAOPT-NEXT: movl -8(%rsp), %eax
				; X86ASM_NORAOPT-NEXT: movl -4(%rsp), %ecx
				; X86ASM_NORAOPT-NEXT: movl %ecx, x(%rip)
				; X86ASM_NORAOPT-NEXT: movl %eax, y(%rip)
				; X86ASM_NORAOPT-NEXT: retq
				br i1 %cond, label %BB1, label %BB2
				BB1:
				%y1 = freeze i32 undef
				%k1 = sub i32 %y1, 1
				br label %END
				BB2:
				%y2 = freeze i32 undef
				br label %END
				END:
				%p = phi i32 [%y1, %BB1], [%y2, %BB2]
				%p2 = phi i32 [%k1, %BB1], [0, %BB2]
				store i32 %p, i32* @x
				store i32 %p2, i32* @y
				ret void
				}

llvm/test/CodeGen/X86/freeze.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc -mtriple=x86_64-unknown-linux-gnu < %s 2>&1 \| FileCheck %s --check-prefix=X86ASM

				%struct.T = type { i32, i32 }

				define i32 @freeze_int() {
				; X86ASM-LABEL: freeze_int:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $ecx killed $eax
				; X86ASM-NEXT: movl %ecx, %eax
				; X86ASM-NEXT: imull %ecx, %eax
				; X86ASM-NEXT: xorl %ecx, %eax
				; X86ASM-NEXT: retq
				%y1 = freeze i32 undef
				%t1 = mul i32 %y1, %y1
				%t2 = xor i32 %t1, %y1
				ret i32 %t2
				}

				define i5 @freeze_int2() {
				; X86ASM-LABEL: freeze_int2:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $cl killed $al
				; X86ASM-NEXT: movl %ecx, %eax
				; X86ASM-NEXT: mulb %cl
				; X86ASM-NEXT: xorb %cl, %al
				; X86ASM-NEXT: retq
				%y1 = freeze i5 undef
				%t1 = mul i5 %y1, %y1
				%t2 = xor i5 %t1, %y1
				ret i5 %t2
				}

				define float @freeze_float() {
				; X86ASM-LABEL: freeze_float:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $xmm0 killed $xmm0
				; X86ASM-NEXT: addss %xmm0, %xmm0
				; X86ASM-NEXT: retq
				%y1 = freeze float undef
				%t1 = fadd float %y1, %y1
				ret float %t1
				}

				define <2 x i32> @freeze_ivec() {
				; X86ASM-LABEL: freeze_ivec:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $xmm0 killed $xmm0
				; X86ASM-NEXT: paddd %xmm0, %xmm0
				; X86ASM-NEXT: retq
				%y1 = freeze <2 x i32> undef
				%t1 = add <2 x i32> %y1, %y1
				ret <2 x i32> %t1
				}

				define i8* @freeze_ptr() {
				; X86ASM-LABEL: freeze_ptr:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $rax killed $rax
				; X86ASM-NEXT: addq $4, %rax
				; X86ASM-NEXT: retq
				%y1 = freeze i8* undef
				%t1 = getelementptr i8, i8* %y1, i64 4
				ret i8* %t1
				}

				define i32 @freeze_struct() {
				; X86ASM-LABEL: freeze_struct:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $eax killed $eax
				; X86ASM-NEXT: addl %eax, %eax
				; X86ASM-NEXT: retq
				%y1 = freeze %struct.T undef
				%v1 = extractvalue %struct.T %y1, 0
				%v2 = extractvalue %struct.T %y1, 1
				%t1 = add i32 %v1, %v2
				ret i32 %t1
				}

				define i32 @freeze_anonstruct() {
				; X86ASM-LABEL: freeze_anonstruct:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $eax killed $eax
				; X86ASM-NEXT: addl %eax, %eax
				; X86ASM-NEXT: retq
				%y1 = freeze {i32, i32} undef
				%v1 = extractvalue {i32, i32} %y1, 0
				%v2 = extractvalue {i32, i32} %y1, 1
				%t1 = add i32 %v1, %v2
				ret i32 %t1
				}

				define i64 @freeze_array() {
				; X86ASM-LABEL: freeze_array:
				; X86ASM: # %bb.0:
				; X86ASM-NEXT: # kill: def $rax killed $rax
				; X86ASM-NEXT: addq %rax, %rax
				; X86ASM-NEXT: retq
				%y1 = freeze [2 x i64] undef
				%v1 = extractvalue [2 x i64] %y1, 0
				%v2 = extractvalue [2 x i64] %y1, 1
				%t1 = add i64 %v1, %v2
				ret i64 %t1
				}

This is an archive of the discontinued LLVM Phabricator instance.

[SelDag] Add FREEZEClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 239013

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

llvm/include/llvm/CodeGen/FastISel.h

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/CodeGen/SelectionDAGISel.h

llvm/include/llvm/Support/TargetOpcodes.def

llvm/include/llvm/Target/Target.td

llvm/lib/CodeGen/ExpandPostRAPseudos.cpp

llvm/lib/CodeGen/SelectionDAG/FastISel.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

llvm/lib/CodeGen/SelectionDAG/LegalizeTypesGeneric.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

llvm/lib/CodeGen/TargetLoweringBase.cpp

llvm/lib/CodeGen/TargetPassConfig.cpp

llvm/lib/Target/AVR/AVRInstrInfo.cpp

llvm/test/CodeGen/X86/fast-isel-freeze.ll

llvm/test/CodeGen/X86/fast-isel.ll

llvm/test/CodeGen/X86/freeze-call.ll

llvm/test/CodeGen/X86/freeze-legalize.ll

llvm/test/CodeGen/X86/freeze-mir.ll

llvm/test/CodeGen/X86/freeze-phielim.ll

llvm/test/CodeGen/X86/freeze.ll

[SelDag] Add FREEZE
ClosedPublic