This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/
-
IR/
1/1
Instruction.cpp
-
Transforms/InstCombine/
-
InstCombine/
5/5
InstCombineCalls.cpp
-
test/Transforms/
-
Transforms/
-
Coroutines/
8/8
coro-debug.ll
-
InstCombine/
-
freeze.ll
-
Reassociate/
-
callbr.ll

Differential D140166

[IR] return nullptr in Instruction::getInsertionPointAfterDef for CallBrInst
AbandonedPublic

Authored by nickdesaulniers on Dec 15 2022, 1:42 PM.

Download Raw Diff

Details

Reviewers

nikic
ChuanqiXu
StephenTozer
void
efriedma
MaskRay

Summary

A recommended in
https://reviews.llvm.org/D135997#3991427.

I will fold this in to D135997 if it is accepted in code review on phab.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	60,060 ms	x64 debian > ThreadSanitizer-x86_64.ThreadSanitizer-x86_64::restore_stack.cpp

Event Timeline

nickdesaulniers created this revision.Dec 15 2022, 1:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 15 2022, 1:42 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

nickdesaulniers requested review of this revision.Dec 15 2022, 1:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 15 2022, 1:42 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B203459: Diff 483324.Dec 15 2022, 1:43 PM

nickdesaulniers added a parent revision: D135997: [Dominators] check indirect branches of callbr.Dec 15 2022, 1:43 PM

nickdesaulniers added reviewers: nikic, ChuanqiXu, StephenTozer.Dec 15 2022, 1:45 PM

nickdesaulniers added subscribers: efriedma, void, jyknight.

This looks fine, apart from the coro-debug.ll bit, that I'm not familiar with.

llvm/lib/IR/Instruction.cpp
142–144	Maybe comment on why? (Def is available in multiple successors, there's no single dominating insertion point.)

ChuanqiXu added inline comments.Dec 15 2022, 6:05 PM

llvm/test/Transforms/Coroutines/coro-debug.ll
196–198	We don't care about the inserted checks in the test. It should be fine to check the `llvm.dbg.declare` is in the basic block of `DEFAULT_DEST`. So maybe we can check these 2 lines are not empty or we can check there is no new BB declaration before `llvm.dbg.declare`.

Not very familiar with this area, but it looks as though currently InstCombinerImpl::transformConstExprCastCall in InstCombineCalls.cpp will need updating as well. Specifically it may transform a CallBr instruction and insert a Cast for the return, for which it calls NewCall->getInsertionPointAfterDef() and asserts that the return is non-null. Not sure what the solution would be, but it might require dropping support for CallBr in transformConstExprCastCall if the return type is changed: there's already a block in there that bails out of transforming invoke or callbr instructions if they are used in a PHI node, since there's no place to insert the Cast in that case.

add comment, fix InstCombinerImpl::transformConstExprCastCall

In D140166#4001785, @StephenTozer wrote:

Not very familiar with this area, but it looks as though currently InstCombinerImpl::transformConstExprCastCall in InstCombineCalls.cpp will need updating as well. Specifically it may transform a CallBr instruction and insert a Cast for the return, for which it calls NewCall->getInsertionPointAfterDef() and asserts that the return is non-null. Not sure what the solution would be, but it might require dropping support for CallBr in transformConstExprCastCall if the return type is changed: there's already a block in there that bails out of transforming invoke or callbr instructions if they are used in a PHI node, since there's no place to insert the Cast in that case.

Good catch! (I think we were ok since there's already a guard on the callee being a Function and not an InlineAsm and today we don't have frontends generate callbr to Functions; but we could, so I've cleaned that all up, PTAL).

llvm/test/Transforms/Coroutines/coro-debug.ll
196–198	I'm not sure how best to express that to FileCheck. `; CHECK-NEXT-NOT: {{.*}}:` ?

Harbormaster completed remote builds in B204244: Diff 484384.Dec 20 2022, 2:37 PM

ChuanqiXu added inline comments.Dec 20 2022, 5:44 PM

llvm/test/Transforms/Coroutines/coro-debug.ll
196–198	I feel it is a good way to check there is no new BB declaration before `llvm.dbg.declare`

nickdesaulniers mentioned this in D135997: [Dominators] check indirect branches of callbr.Dec 21 2022, 2:27 PM

nickdesaulniers mentioned this in D139872: [llvm][CallBrPrepare] split critical edges.

nickdesaulniers mentioned this in D139883: [llvm][CallBrPrepare] add llvm.callbr.landingpad intrinsic.

nickdesaulniers mentioned this in D139970: [llvm][CallBrPrepare] use SSAUpdater to use intrinsic value.

nickdesaulniers mentioned this in D140160: [llvm][SelectionDAGBuilder] codegen callbr.landingpad intrinsic.

nickdesaulniers mentioned this in D140180: [llvm] add CallBrPrepare pass to pipelines.

nickdesaulniers mentioned this in D136497: [Clang] support for outputs along indirect edges of asm goto.

MaskRay added a subscriber: MaskRay.Dec 21 2022, 2:31 PM

MaskRay added inline comments.

llvm/test/Transforms/Coroutines/coro-debug.ll
196–198	Actually I think `utils/update_test_checks.py` may not be a bad choice for this large test file. If it is not time to migrate the test, I think the patch as-is using `; CHECK-NEXT: %1 = load i8, i8* ; ...` looks good. It doesn't appear that there is more maintenance burden than `; CHECK-NEXT-NOT: {{.*}}:`

void accepted this revision.Dec 21 2022, 2:42 PM

This revision is now accepted and ready to land.Dec 21 2022, 2:42 PM

nickdesaulniers added inline comments.Dec 21 2022, 3:44 PM

llvm/test/Transforms/Coroutines/coro-debug.ll
196–198	Curiously, I tried removing existing CHECK lines from this test, then running `./llvm/utils/update_test_checks.py` on this. The result doesn't pass `llvm-lit -vv llvm/test/Transforms/Coroutines/coro-debug.ll` which may be why this test did not use `./llvm/utils/update_test_checks.py` in the first place.

ChuanqiXu added inline comments.Dec 21 2022, 6:29 PM

llvm/test/Transforms/Coroutines/coro-debug.ll
196–198	Personally I prefer to not use `utils/update_test_checks.py` in coroutine's tests. Since CoroSplit pass will generate many codes and it is pretty hard to read. Currently when I see the coroutine's tests, I can know the pattern of the interesting part easily instead of reading tons of `CHECK` (that's what I feel when I read the test generated by `utils/update_test_checks.py`). And for this patch, it is true that `; CHECK-NEXT: %1 = load i8, i8* ; ...` wouldn't matter a lot. But I still feel better to make the tests focus on the things it want to test.

nickdesaulniers added inline comments.Dec 22 2022, 11:24 AM

llvm/test/Transforms/Coroutines/coro-debug.ll
196–198	I never actually tested that `; CHECK-NEXT-NOT: {{.*}}:` would be viable. Why I try it: llvm-project/llvm/test/Transforms/Coroutines/coro-debug.ll:196:9: error: unsupported -NOT combo on prefix 'CHECK'

I agree that ; CHECK-NOT: {{.*}}: is uncommon and doesn't seem particularly useful in this case.
Actually, I think the test has a fair chance that it may bit rot and for maintainability we probably should a comment about the C++ source and some comments about what the test file test.

CC @dblaikie for the coro-debug.ll debate. The test was added from D33614.

we probably should a comment about the C++ source

Maybe there is not a C++ source for this test file. On the one side, the Coroutines intrinsics in LLVM is intended to be an extension for LLVM, which shouldn't be dependent on the frontend language. I mean, maybe it is not possible to find a C++ program which can generate the same or similar LLVM IR with the test in LLVM Coroutines tests. On the other side, you may be interested that why we don't test directly for the IR generated from a real C++ program. The answer from me is that it'll generate too many codes from a real C++ coroutines program due to the grammar of C++ (coroutines). Previously when I tried to use the IR generated from C++ directly, I found it is too big and complex so that I'll be confusing about the test (or in another word, I feel developers can't be focus on the test in that way).

So as a result, many tests in LLVM coroutines are written by hand instead of using the generated IR from a real C++ program.

and some comments about what the test file test.

For this test file, the things it test should be: the debug information for a coroutine function should be remained in its split functions. (Background: a coroutine will be split into pieces). And personally I feel it is already addressed in the top of the file.

Actually, I think the test has a fair chance that it may bit rot

We indeed lack some test for coroutines. But for the specific test, I feel it is good as far as I know.

llvm/test/Transforms/Coroutines/coro-debug.ll
196–198	I think we can refactor them into: ; CHECK: [[DEFAULT_DEST]]: ; CHECK-NOT: {{.*}}: ; CHECK: call void @llvm.dbg.declare(metadata i32 %[[CALLBR_RES]] Since currently there is no `NEXT` relationships.

In D140166#4013772, @MaskRay wrote:

I agree that ; CHECK-NOT: {{.*}}: is uncommon and doesn't seem particularly useful in this case.

I read it as "check that there's not another label between the previous checked label and the following instruction" - which seems OK to me? I might be inclined to firm it up a bit by adding {{$}} at the end to make it more clear that it's checking for a label and not a : that might appear anywhere else in a match line. (if it needs to be resilient to autogenerated comments after the label - yeah, more regexing would be OK, {{( ;.*)?$}} or whatever is necessary to make that work seems OK to me)

Actually, I think the test has a fair chance that it may bit rot and for maintainability we probably should a comment about the C++ source and some comments about what the test file test.

CC @dblaikie for the coro-debug.ll debate. The test was added from D33614.

I'm not sure I'm following all aspects of the discussion here - one is whether we should include the full C++ source and make the IR match the source, which will make it longer but maybe more explainable/maintainable in some ways (handcrafting a few IR instructions is one thing, but if it's so complicated/long we can't reasonably handcraft/modify it, then maybe it's better to generate it from clang instead). Especially for debug information, we tend to use frontend-generated IR, as simplified as possible. Though this test case might be a different situation - the IR is long, but not unmaintainably so, I think? How long's the frontend-generated IR (after as much IR optimization as is acceptable while still preserving the interesting thing to test?)?

How long's the frontend-generated IR (after as much IR optimization as is acceptable while still preserving the interesting thing to test?)?

For most tests under llvm/test/Transform/Coroutines, there is not a corresponding frontend source due to the design of coroutine will generate too many codes.

For example, for the following simple coroutine which returns 43 simply:

#include <cstdio>
#include <coroutine>
#include <exception>
#include <cassert>

template <typename T> struct task {
	struct promise_type {
		T value{123};
		std::coroutine_handle<> caller{std::noop_coroutine()};
		
		struct final_awaiter: std::suspend_always {
			auto await_suspend(std::coroutine_handle<promise_type> me) const noexcept {
				return me.promise().caller;
			}
		};
		
		constexpr auto initial_suspend() const noexcept {
			return std::suspend_always();
		}
		constexpr auto final_suspend() const noexcept {
			return final_awaiter{};
		}
		auto unhandled_exception() noexcept {
			// ignore
		}
		constexpr void return_value(T v) noexcept {
			value = v;
		} 
		constexpr auto & get_return_object() noexcept {
			return *this;
		}
	};
	
	using coroutine_handle = std::coroutine_handle<promise_type>;
	
	promise_type & promise{nullptr};
	
	task(promise_type & p) noexcept: promise{p} { }
	
	~task() noexcept {
		coroutine_handle::from_promise(promise).destroy();
	}
	
	auto await_ready() noexcept {
        return false;
    }

    auto await_suspend(std::coroutine_handle<> caller) noexcept {
        promise.caller = caller;
        return coroutine_handle::from_promise(promise);
    }

    constexpr auto await_resume() const noexcept {
        return promise.value;
    }
	
	// non-coroutine access to result
	auto get() noexcept {
		const auto handle = coroutine_handle::from_promise(promise);
		
		if (!handle.done()) {
			handle.resume();
		}
		
        return promise.value;
	}
};


auto a() noexcept -> task<int> {
	co_return 42;
}

Note that there is only a coroutine a() in the above example. The frontend generated code (with -g) will consist of 1144 lines of IR codes. I get it by:

clang++ -std=c++20 src.cpp -O3 -S -emit-llvm -g -Xclang -disable-llvm-passes -o frontend-generated.ll

And for the optimized (but before coroutine splitting) IR, it'll still consist of 645 lines of IR code. (I get it by inserting codes in CoroSplit.cpp). So I feel it is too lengthy for a lit test.

BTW, I feel the discussion is a little bit far from the revision itself. I suggest the revision land first if it removes the unnecessary check in coro-debug.ll. Then we can discuss the quality/maintainability for the test of coroutines somewhere else.

Note that there is only a coroutine a() in the above example. The frontend generated code (with -g) will consist of 1144 lines of IR codes. I get it by:
clang++ -std=c++20 src.cpp -O3 -S -emit-llvm -g -Xclang -disable-llvm-passes -o frontend-generated.ll
And for the optimized (but before coroutine splitting) IR, it'll still consist of 645 lines of IR code. (I get it by inserting codes in CoroSplit.cpp). So I feel it is too lengthy for a lit test.

That's with all LLVM passes disabled - it might be interesting if there's something more manageable with some optimization passes applied? (I guess if you don't disable them from clang, -O3 ends up lowering the IR beyond the coroutine abstracitons - so you'd have to do this manually with opt or something to only apply some optimizations and stop before the coroutines were lowered too far/beyond what's needed for this test)

Though, yeah, it still wouldn't surprise me if it's infeasibly long.

That said - I'm still confused about the CHECK-NOT for the label - that looked roughly OK to me/didn't seem too brittle (at least with checking for the end of line, if that's practical)

In D140166#4018476, @dblaikie wrote:
Note that there is only a coroutine a() in the above example. The frontend generated code (with -g) will consist of 1144 lines of IR codes. I get it by:
clang++ -std=c++20 src.cpp -O3 -S -emit-llvm -g -Xclang -disable-llvm-passes -o frontend-generated.ll
And for the optimized (but before coroutine splitting) IR, it'll still consist of 645 lines of IR code. (I get it by inserting codes in CoroSplit.cpp). So I feel it is too lengthy for a lit test.
That's with all LLVM passes disabled - it might be interesting if there's something more manageable with some optimization passes applied? (I guess if you don't disable them from clang, -O3 ends up lowering the IR beyond the coroutine abstracitons - so you'd have to do this manually with opt or something to only apply some optimizations and stop before the coroutines were lowered too far/beyond what's needed for this test)

As far as I know, I don't the method to generate the IR before Coro-splitting automatically. In the past I always tried to insert codes in CoroSplit pass manually. I didn't feel tired for it.

Though, yeah, it still wouldn't surprise me if it's infeasibly long.

That said - I'm still confused about the CHECK-NOT for the label - that looked roughly OK to me/didn't seem too brittle (at least with checking for the end of line, if that's practical)

In the test, we don't care about the inserted instructions. We only care about that the @llvm.dbg.declare lives in the DEFAULT_DEST BB. So it is slightly better to me if we only check the interesting part. The good part if one day someone else made a similar change (insert some instructions to the BB or remove some instructions in the BB), then he probably don't need to worry about the test. Personally, I feel bad when I made a change and I ran the test and many test failed but I found they are not related to my change. So I feel better to add CHECK-NOT to skip these tests.

In D140166#4019070, @ChuanqiXu wrote:
In D140166#4018476, @dblaikie wrote:
Note that there is only a coroutine a() in the above example. The frontend generated code (with -g) will consist of 1144 lines of IR codes. I get it by:
clang++ -std=c++20 src.cpp -O3 -S -emit-llvm -g -Xclang -disable-llvm-passes -o frontend-generated.ll
And for the optimized (but before coroutine splitting) IR, it'll still consist of 645 lines of IR code. (I get it by inserting codes in CoroSplit.cpp). So I feel it is too lengthy for a lit test.
That's with all LLVM passes disabled - it might be interesting if there's something more manageable with some optimization passes applied? (I guess if you don't disable them from clang, -O3 ends up lowering the IR beyond the coroutine abstracitons - so you'd have to do this manually with opt or something to only apply some optimizations and stop before the coroutines were lowered too far/beyond what's needed for this test)
As far as I know, I don't the method to generate the IR before Coro-splitting automatically. In the past I always tried to insert codes in CoroSplit pass manually. I didn't feel tired for it.

Sorry, I didn't follow this - could you rephrase?

Though, yeah, it still wouldn't surprise me if it's infeasibly long.

That said - I'm still confused about the CHECK-NOT for the label - that looked roughly OK to me/didn't seem too brittle (at least with checking for the end of line, if that's practical)

In the test, we don't care about the inserted instructions. We only care about that the @llvm.dbg.declare lives in the DEFAULT_DEST BB. So it is slightly better to me if we only check the interesting part. The good part if one day someone else made a similar change (insert some instructions to the BB or remove some instructions in the BB), then he probably don't need to worry about the test. Personally, I feel bad when I made a change and I ran the test and many test failed but I found they are not related to my change. So I feel better to add CHECK-NOT to skip these tests.

Yeah, roughly following here & I think that's my understanding as well.

@MaskRay Could you describe more/restate your concerns with the particular CHECK-NOT: {{.*}}: you're referring to?

In D140166#4019420, @dblaikie wrote:
In D140166#4019070, @ChuanqiXu wrote:
In D140166#4018476, @dblaikie wrote:
Note that there is only a coroutine a() in the above example. The frontend generated code (with -g) will consist of 1144 lines of IR codes. I get it by:
clang++ -std=c++20 src.cpp -O3 -S -emit-llvm -g -Xclang -disable-llvm-passes -o frontend-generated.ll
And for the optimized (but before coroutine splitting) IR, it'll still consist of 645 lines of IR code. (I get it by inserting codes in CoroSplit.cpp). So I feel it is too lengthy for a lit test.
That's with all LLVM passes disabled - it might be interesting if there's something more manageable with some optimization passes applied? (I guess if you don't disable them from clang, -O3 ends up lowering the IR beyond the coroutine abstracitons - so you'd have to do this manually with opt or something to only apply some optimizations and stop before the coroutines were lowered too far/beyond what's needed for this test)
As far as I know, I don't the method to generate the IR before Coro-splitting automatically. In the past I always tried to insert codes in CoroSplit pass manually. I didn't feel tired for it.
Sorry, I didn't follow this - could you rephrase?

Sorry. I thought you were asking: "if there is an automatic/manageable way we can get the IR applied with some optimizations but before coro-spliting?". Then my reply is "No, I don't know such methods. I always create it by editing the codes."

rebase, format, fix coro-debug.ll test as per @ChuanqiXu

Herald added a subscriber: StephenFan. · View Herald TranscriptJan 10 2023, 11:51 AM

nickdesaulniers marked 5 inline comments as done.Jan 10 2023, 11:52 AM

Harbormaster completed remote builds in B206877: Diff 487918.Jan 10 2023, 2:54 PM

LGTM. Thanks for your patience!

Small point about the CallBr guard, but no other issues from my PoV!

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp
3488–3490	This might be more aggressive than necessary, since if the return type doesn't change or the call instruction has no use, a PHI does not need to be inserted; this could be moderated by moving this to the block at line 3409 (`if (OldRetTy != NewRetTy)`), and checking whether the return value has any uses. With that said, this is a minor point since as you said this won't even be touched at this point, so doesn't need to block merging imo.
3519	See above comment.

nikic added inline comments.Jan 11 2023, 10:03 AM

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp
3488–3490	This entire transform is not relevant for callbr, it only works on calls to functions, not calls to inline asm. You could replace this with `assert(!isa<CallBrInst>(Call))`.

nickdesaulniers planned changes to this revision.Jan 17 2023, 12:39 PM

nickdesaulniers marked an inline comment as done.Jan 17 2023, 12:55 PM

nickdesaulniers added inline comments.

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp
3488–3490	adding the assert causes Transforms/InstCombine/freeze.ll to fail, hitting that assertion.

delay check for callbr inst, as per @StephenTozer

This revision is now accepted and ready to land.Jan 17 2023, 1:06 PM

nikic added inline comments.Jan 17 2023, 1:21 PM

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp
3488–3490	Where did you place the assert? It needs to be after the callee check above, not at the start of the function.

use assert (in correct location) as per @nikic

Harbormaster completed remote builds in B208339: Diff 489950.Jan 17 2023, 5:05 PM

rebase

efriedma accepted this revision.Jan 18 2023, 11:33 AM

Harbormaster completed remote builds in B208556: Diff 490236.Jan 18 2023, 11:49 AM

void accepted this revision.Jan 18 2023, 3:25 PM

rebase, format

MaskRay accepted this revision.Feb 6 2023, 9:44 AM

Harbormaster completed remote builds in B212126: Diff 495166.Feb 6 2023, 10:25 AM

Merged into https://reviews.llvm.org/D135997.

nickdesaulniers mentioned this in rG45a291b5f609: [Dominators] check indirect branches of callbr.Feb 16 2023, 6:04 PM

Revision Contents

Path

Size

llvm/

lib/

IR/

Instruction.cpp

5 lines

Transforms/

InstCombine/

InstCombineCalls.cpp

13 lines

test/

Transforms/

Coroutines/

coro-debug.ll

3 lines

InstCombine/

freeze.ll

2 lines

Reassociate/

callbr.ll

6 lines

Diff 495166

llvm/lib/IR/Instruction.cpp

Show First 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	Instruction *Instruction::getInsertionPointAfterDef() {
BasicBlock::iterator InsertPt;		BasicBlock::iterator InsertPt;
if (auto *PN = dyn_cast<PHINode>(this)) {		if (auto *PN = dyn_cast<PHINode>(this)) {
InsertBB = PN->getParent();		InsertBB = PN->getParent();
InsertPt = InsertBB->getFirstInsertionPt();		InsertPt = InsertBB->getFirstInsertionPt();
} else if (auto *II = dyn_cast<InvokeInst>(this)) {		} else if (auto *II = dyn_cast<InvokeInst>(this)) {
InsertBB = II->getNormalDest();		InsertBB = II->getNormalDest();
InsertPt = InsertBB->getFirstInsertionPt();		InsertPt = InsertBB->getFirstInsertionPt();
} else if (auto *CB = dyn_cast<CallBrInst>(this)) {		} else if (auto *CB = dyn_cast<CallBrInst>(this)) {
InsertBB = CB->getDefaultDest();		// Def is available in multiple successors, there's no single dominating
InsertPt = InsertBB->getFirstInsertionPt();		// insertion point.
		return nullptr;
		nikicUnsubmitted Done Reply Inline Actions Maybe comment on why? (Def is available in multiple successors, there's no single dominating insertion point.) nikic: Maybe comment on why? (Def is available in multiple successors, there's no single dominating…
} else {		} else {
assert(!isTerminator() && "Only invoke/callbr terminators return value");		assert(!isTerminator() && "Only invoke/callbr terminators return value");
InsertBB = getParent();		InsertBB = getParent();
InsertPt = std::next(getIterator());		InsertPt = std::next(getIterator());
}		}

// catchswitch blocks don't have any legal insertion point (because they		// catchswitch blocks don't have any legal insertion point (because they
// are both an exception pad and a terminator).		// are both an exception pad and a terminator).
▲ Show 20 Lines • Show All 768 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp

Show First 20 Lines • Show All 3,471 Lines • ▼ Show 20 Lines Instruction *InstCombinerImpl::visitCallBase(CallBase &Call) {

} }

default: { break; } default: { break; }

} }

return Changed ? &Call : nullptr; return Changed ? &Call : nullptr;

} }

/// If the callee is a constexpr cast of a function, attempt to move the cast to /// If the callee is a constexpr cast of a function, attempt to move the cast to

/// the arguments of the call/callbr/invoke. /// the arguments of the call/invoke.

/// CallBrInst is not supported.

bool InstCombinerImpl::transformConstExprCastCall(CallBase &Call) { bool InstCombinerImpl::transformConstExprCastCall(CallBase &Call) {

auto *Callee = auto *Callee =

dyn_cast<Function>(Call.getCalledOperand()->stripPointerCasts()); dyn_cast<Function>(Call.getCalledOperand()->stripPointerCasts());

if (!Callee) if (!Callee)

return false; return false;

assert(!isa<CallBrInst>(Call) &&

"CallBr's don't have a single point after a def to insert at");

StephenTozerUnsubmitted

Done

This might be more aggressive than necessary, since if the return type doesn't change or the call instruction has no use, a PHI does not need to be inserted; this could be moderated by moving this to the block at line 3409 (if (OldRetTy != NewRetTy)), and checking whether the return value has any uses. With that said, this is a minor point since as you said this won't even be touched at this point, so doesn't need to block merging imo.

StephenTozer: This might be more aggressive than necessary, since if the return type doesn't change or the…

nikicUnsubmitted

Done

This entire transform is not relevant for callbr, it only works on calls to functions, not calls to inline asm. You could replace this with assert(!isa<CallBrInst>(Call)).

nikic: This entire transform is not relevant for callbr, it only works on calls to functions, not…

nickdesaulniersAuthorUnsubmitted

Done

adding the assert causes Transforms/InstCombine/freeze.ll to fail, hitting that assertion.

nickdesaulniers: adding the assert causes Transforms/InstCombine/freeze.ll to fail, hitting that assertion.

nikicUnsubmitted

Done

Where did you place the assert? It needs to be after the callee check above, not at the start of the function.

nikic: Where did you place the assert? It needs to be after the callee check above, not at the start…

// If this is a call to a thunk function, don't remove the cast. Thunks are // If this is a call to a thunk function, don't remove the cast. Thunks are

// used to transparently forward all incoming parameters and outgoing return // used to transparently forward all incoming parameters and outgoing return

// values, so it's important to leave the cast in place. // values, so it's important to leave the cast in place.

if (Callee->hasFnAttribute("thunk")) if (Callee->hasFnAttribute("thunk"))

return false; return false;

// If this is a musttail call, the callee's prototype must match the caller's // If this is a musttail call, the callee's prototype must match the caller's

// prototype with the exception of pointee types. The code below doesn't // prototype with the exception of pointee types. The code below doesn't

Show All 12 Lines bool InstCombinerImpl::transformConstExprCastCall(CallBase &Call) {

Type *OldRetTy = Caller->getType(); Type *OldRetTy = Caller->getType();

Type *NewRetTy = FT->getReturnType(); Type *NewRetTy = FT->getReturnType();

// Check to see if we are changing the return type... // Check to see if we are changing the return type...

if (OldRetTy != NewRetTy) { if (OldRetTy != NewRetTy) {

if (NewRetTy->isStructTy()) if (NewRetTy->isStructTy())

return false; // TODO: Handle multiple return values. return false; // TODO: Handle multiple return values.

StephenTozerUnsubmitted

Done

return false; // TODO: Handle multiple return values.

+ // CallBr's don't have a single point to insert a cast at.

+ if (isa<CallBrInst>(Call) && !Caller->use_empty())

+ return false;

if (!CastInst::isBitOrNoopPointerCastable(NewRetTy, OldRetTy, DL)) {

See above comment.

StephenTozer: See above comment.

if (!CastInst::isBitOrNoopPointerCastable(NewRetTy, OldRetTy, DL)) { if (!CastInst::isBitOrNoopPointerCastable(NewRetTy, OldRetTy, DL)) {

if (Callee->isDeclaration()) if (Callee->isDeclaration())

return false; // Cannot transform this return value. return false; // Cannot transform this return value.

if (!Caller->use_empty() && if (!Caller->use_empty() &&

// void -> non-void is handled specially // void -> non-void is handled specially

!NewRetTy->isVoidTy()) !NewRetTy->isVoidTy())

return false; // Cannot transform this return value. return false; // Cannot transform this return value.

} }

if (!CallerPAL.isEmpty() && !Caller->use_empty()) { if (!CallerPAL.isEmpty() && !Caller->use_empty()) {

AttrBuilder RAttrs(FT->getContext(), CallerPAL.getRetAttrs()); AttrBuilder RAttrs(FT->getContext(), CallerPAL.getRetAttrs());

if (RAttrs.overlaps(AttributeFuncs::typeIncompatible(NewRetTy))) if (RAttrs.overlaps(AttributeFuncs::typeIncompatible(NewRetTy)))

return false; // Attribute not compatible with transformed value. return false; // Attribute not compatible with transformed value.

} }

// If the callbase is an invoke/callbr instruction, and the return value is // If the callbase is an invoke instruction, and the return value is

// used by a PHI node in a successor, we cannot change the return type of // used by a PHI node in a successor, we cannot change the return type of

// the call because there is no place to put the cast instruction (without // the call because there is no place to put the cast instruction (without

// breaking the critical edge). Bail out in this case. // breaking the critical edge). Bail out in this case.

if (!Caller->use_empty()) { if (!Caller->use_empty()) {

BasicBlock *PhisNotSupportedBlock = nullptr; BasicBlock *PhisNotSupportedBlock = nullptr;

if (auto *II = dyn_cast<InvokeInst>(Caller)) if (auto *II = dyn_cast<InvokeInst>(Caller))

PhisNotSupportedBlock = II->getNormalDest(); PhisNotSupportedBlock = II->getNormalDest();

if (auto *CB = dyn_cast<CallBrInst>(Caller))

PhisNotSupportedBlock = CB->getDefaultDest();

if (PhisNotSupportedBlock) if (PhisNotSupportedBlock)

for (User *U : Caller->users()) for (User *U : Caller->users())

if (PHINode *PN = dyn_cast<PHINode>(U)) if (PHINode *PN = dyn_cast<PHINode>(U))

if (PN->getParent() == PhisNotSupportedBlock) if (PN->getParent() == PhisNotSupportedBlock)

return false; return false;

} }

▲ Show 20 Lines • Show All 167 Lines • ▼ Show 20 Lines bool InstCombinerImpl::transformConstExprCastCall(CallBase &Call) {

SmallVector<OperandBundleDef, 1> OpBundles; SmallVector<OperandBundleDef, 1> OpBundles;

Call.getOperandBundlesAsDefs(OpBundles); Call.getOperandBundlesAsDefs(OpBundles);

CallBase *NewCall; CallBase *NewCall;

if (InvokeInst *II = dyn_cast<InvokeInst>(Caller)) { if (InvokeInst *II = dyn_cast<InvokeInst>(Caller)) {

NewCall = Builder.CreateInvoke(Callee, II->getNormalDest(), NewCall = Builder.CreateInvoke(Callee, II->getNormalDest(),

II->getUnwindDest(), Args, OpBundles); II->getUnwindDest(), Args, OpBundles);

} else if (CallBrInst *CBI = dyn_cast<CallBrInst>(Caller)) {

NewCall = Builder.CreateCallBr(Callee, CBI->getDefaultDest(),

CBI->getIndirectDests(), Args, OpBundles);

} else { } else {

NewCall = Builder.CreateCall(Callee, Args, OpBundles); NewCall = Builder.CreateCall(Callee, Args, OpBundles);

cast<CallInst>(NewCall)->setTailCallKind( cast<CallInst>(NewCall)->setTailCallKind(

cast<CallInst>(Caller)->getTailCallKind()); cast<CallInst>(Caller)->getTailCallKind());

} }

NewCall->takeName(Caller); NewCall->takeName(Caller);

NewCall->setCallingConv(Call.getCallingConv()); NewCall->setCallingConv(Call.getCallingConv());

NewCall->setAttributes(NewCallerPAL); NewCall->setAttributes(NewCallerPAL);

▲ Show 20 Lines • Show All 186 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-debug.ll

	Show First 20 Lines • Show All 187 Lines • ▼ Show 20 Lines
	; Check that the dbg.declare intrinsic of invoke instruction is hanled correctly.			; Check that the dbg.declare intrinsic of invoke instruction is hanled correctly.
	; CHECK: %[[ALLOCATED_STORAGE:.+]] = invoke i8* @allocate()			; CHECK: %[[ALLOCATED_STORAGE:.+]] = invoke i8* @allocate()
	; CHECK-NEXT: to label %[[NORMAL_DEST:.+]] unwind			; CHECK-NEXT: to label %[[NORMAL_DEST:.+]] unwind
	; CHECK: [[NORMAL_DEST]]			; CHECK: [[NORMAL_DEST]]
	; CHECK-NEXT: call void @llvm.dbg.declare(metadata i8* %[[ALLOCATED_STORAGE]]			; CHECK-NEXT: call void @llvm.dbg.declare(metadata i8* %[[ALLOCATED_STORAGE]]
	; CHECK: %[[CALLBR_RES:.+]] = callbr i32 asm			; CHECK: %[[CALLBR_RES:.+]] = callbr i32 asm
	; CHECK-NEXT: to label %[[DEFAULT_DEST:.+]] [label			; CHECK-NEXT: to label %[[DEFAULT_DEST:.+]] [label
	; CHECK: [[DEFAULT_DEST]]:			; CHECK: [[DEFAULT_DEST]]:
	; CHECK-NEXT: call void @llvm.dbg.declare(metadata i32 %[[CALLBR_RES]]			; CHECK-NOT: {{.*}}:
				; CHECK: call void @llvm.dbg.declare(metadata i32 %[[CALLBR_RES]]
	; CHECK: define internal fastcc void @f.destroy(%f.Frame* noundef nonnull align 8 dereferenceable(40) %FramePtr) #0 personality i32 0 !dbg ![[DESTROY:[0-9]+]]			; CHECK: define internal fastcc void @f.destroy(%f.Frame* noundef nonnull align 8 dereferenceable(40) %FramePtr) #0 personality i32 0 !dbg ![[DESTROY:[0-9]+]]
				ChuanqiXuUnsubmitted Done Reply Inline Actions We don't care about the inserted checks in the test. It should be fine to check the `llvm.dbg.declare` is in the basic block of `DEFAULT_DEST`. So maybe we can check these 2 lines are not empty or we can check there is no new BB declaration before `llvm.dbg.declare`. ChuanqiXu: We don't care about the inserted checks in the test. It should be fine to check the `llvm.dbg.
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions I'm not sure how best to express that to FileCheck. `; CHECK-NEXT-NOT: {{.}}:` ? nickdesaulniers:* I'm not sure how best to express that to FileCheck. `; CHECK-NEXT-NOT: {{.*}}:` ?
				ChuanqiXuUnsubmitted Done Reply Inline Actions I feel it is a good way to check there is no new BB declaration before `llvm.dbg.declare` ChuanqiXu: I feel it is a good way to check there is no new BB declaration before `llvm.dbg.declare`
				MaskRayUnsubmitted Done Reply Inline Actions Actually I think `utils/update_test_checks.py` may not be a bad choice for this large test file. If it is not time to migrate the test, I think the patch as-is using `; CHECK-NEXT: %1 = load i8, i8* ; ...` looks good. It doesn't appear that there is more maintenance burden than `; CHECK-NEXT-NOT: {{.}}:` MaskRay:* Actually I think `utils/update_test_checks.py` may not be a bad choice for this large test file.
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Curiously, I tried removing existing CHECK lines from this test, then running `./llvm/utils/update_test_checks.py` on this. The result doesn't pass `llvm-lit -vv llvm/test/Transforms/Coroutines/coro-debug.ll` which may be why this test did not use `./llvm/utils/update_test_checks.py` in the first place. nickdesaulniers: Curiously, I tried removing existing CHECK lines from this test, then running `.
				ChuanqiXuUnsubmitted Done Reply Inline Actions Personally I prefer to not use `utils/update_test_checks.py` in coroutine's tests. Since CoroSplit pass will generate many codes and it is pretty hard to read. Currently when I see the coroutine's tests, I can know the pattern of the interesting part easily instead of reading tons of `CHECK` (that's what I feel when I read the test generated by `utils/update_test_checks.py`). And for this patch, it is true that `; CHECK-NEXT: %1 = load i8, i8* ; ...` wouldn't matter a lot. But I still feel better to make the tests focus on the things it want to test. ChuanqiXu: Personally I prefer to not use `utils/update_test_checks.py` in coroutine's tests. Since…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions I never actually tested that `; CHECK-NEXT-NOT: {{.}}:` would be viable. Why I try it: llvm-project/llvm/test/Transforms/Coroutines/coro-debug.ll:196:9: error: unsupported -NOT combo on prefix 'CHECK' nickdesaulniers:* I never actually tested that `; CHECK-NEXT-NOT: {{.*}}:` would be viable. Why I try it: >…
				ChuanqiXuUnsubmitted Done Reply Inline Actions I think we can refactor them into: ; CHECK: [[DEFAULT_DEST]]: ; CHECK-NOT: {{.}}: ; CHECK: call void @llvm.dbg.declare(metadata i32 %[[CALLBR_RES]] Since currently there is no `NEXT` relationships. ChuanqiXu:* I think we can refactor them into: ``` ; CHECK: [[DEFAULT_DEST]]: ; CHECK-NOT: {{.*}}: ; CHECK…
	; CHECK: define internal fastcc void @f.cleanup(%f.Frame* noundef nonnull align 8 dereferenceable(40) %FramePtr) #0 personality i32 0 !dbg ![[CLEANUP:[0-9]+]]			; CHECK: define internal fastcc void @f.cleanup(%f.Frame* noundef nonnull align 8 dereferenceable(40) %FramePtr) #0 personality i32 0 !dbg ![[CLEANUP:[0-9]+]]

	; CHECK: ![[ORIG]] = distinct !DISubprogram(name: "f", linkageName: "flink"			; CHECK: ![[ORIG]] = distinct !DISubprogram(name: "f", linkageName: "flink"

	; CHECK: ![[RESUME]] = distinct !DISubprogram(name: "f", linkageName: "flink"			; CHECK: ![[RESUME]] = distinct !DISubprogram(name: "f", linkageName: "flink"
	; CHECK: ![[RESUME_COROHDL]] = !DILocalVariable(name: "coro_hdl", scope: ![[RESUME]]			; CHECK: ![[RESUME_COROHDL]] = !DILocalVariable(name: "coro_hdl", scope: ![[RESUME]]
	; CHECK: ![[RESUME_X]] = !DILocalVariable(name: "x", arg: 1, scope: ![[RESUME]]			; CHECK: ![[RESUME_X]] = !DILocalVariable(name: "x", arg: 1, scope: ![[RESUME]]
	; CHECK: ![[RESUME_DIRECT]] = !DILocalVariable(name: "direct_mem", scope: ![[RESUME]]			; CHECK: ![[RESUME_DIRECT]] = !DILocalVariable(name: "direct_mem", scope: ![[RESUME]]
	; CHECK: ![[RESUME_CONST]] = !DILocalVariable(name: "direct_const", scope: ![[RESUME]]			; CHECK: ![[RESUME_CONST]] = !DILocalVariable(name: "direct_const", scope: ![[RESUME]]
	; CHECK: ![[RESUME_DIRECT_VALUE]] = !DILocalVariable(name: "direct_value", scope: ![[RESUME]]			; CHECK: ![[RESUME_DIRECT_VALUE]] = !DILocalVariable(name: "direct_value", scope: ![[RESUME]]

	; CHECK: ![[DESTROY]] = distinct !DISubprogram(name: "f", linkageName: "flink"			; CHECK: ![[DESTROY]] = distinct !DISubprogram(name: "f", linkageName: "flink"

	; CHECK: ![[CLEANUP]] = distinct !DISubprogram(name: "f", linkageName: "flink"			; CHECK: ![[CLEANUP]] = distinct !DISubprogram(name: "f", linkageName: "flink"

llvm/test/Transforms/InstCombine/freeze.ll

	Show First 20 Lines • Show All 447 Lines • ▼ Show 20 Lines

	define i32 @freeze_callbr_use_after_phi(i1 %c) {			define i32 @freeze_callbr_use_after_phi(i1 %c) {
	; CHECK-LABEL: @freeze_callbr_use_after_phi(			; CHECK-LABEL: @freeze_callbr_use_after_phi(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[X:%.*]] = callbr i32 asm sideeffect "", "=r"() #[[ATTR1:[0-9]+]]			; CHECK-NEXT: [[X:%.*]] = callbr i32 asm sideeffect "", "=r"() #[[ATTR1:[0-9]+]]
	; CHECK-NEXT: to label [[CALLBR_CONT:%.*]] []			; CHECK-NEXT: to label [[CALLBR_CONT:%.*]] []
	; CHECK: callbr.cont:			; CHECK: callbr.cont:
	; CHECK-NEXT: [[PHI:%.]] = phi i32 [ [[X]], [[ENTRY:%.]] ], [ 0, [[CALLBR_CONT]] ]			; CHECK-NEXT: [[PHI:%.]] = phi i32 [ [[X]], [[ENTRY:%.]] ], [ 0, [[CALLBR_CONT]] ]
				; CHECK-NEXT: call void @use_i32(i32 [[X]])
	; CHECK-NEXT: [[FR:%.*]] = freeze i32 [[X]]			; CHECK-NEXT: [[FR:%.*]] = freeze i32 [[X]]
	; CHECK-NEXT: call void @use_i32(i32 [[FR]])			; CHECK-NEXT: call void @use_i32(i32 [[FR]])
	; CHECK-NEXT: call void @use_i32(i32 [[FR]])
	; CHECK-NEXT: call void @use_i32(i32 [[PHI]])			; CHECK-NEXT: call void @use_i32(i32 [[PHI]])
	; CHECK-NEXT: br label [[CALLBR_CONT]]			; CHECK-NEXT: br label [[CALLBR_CONT]]
	;			;
	entry:			entry:
	%x = callbr i32 asm sideeffect "", "=r"()			%x = callbr i32 asm sideeffect "", "=r"()
	to label %callbr.cont []			to label %callbr.cont []

	callbr.cont:			callbr.cont:
	▲ Show 20 Lines • Show All 663 Lines • Show Last 20 Lines

llvm/test/Transforms/Reassociate/callbr.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -passes=reassociate < %s \| FileCheck %s			; RUN: opt -S -passes=reassociate < %s \| FileCheck %s

	define i32 @test(i1 %b) {			define i32 @test(i1 %b) {
	; CHECK-LABEL: @test(			; CHECK-LABEL: @test(
	; CHECK-NEXT: [[RES:%.*]] = callbr i32 asm "", "=r,!i"()			; CHECK-NEXT: [[RES:%.*]] = callbr i32 asm "", "=r,!i"()
	; CHECK-NEXT: to label [[NORMAL:%.*]] [label %abnormal]			; CHECK-NEXT: to label [[NORMAL:%.*]] [label %abnormal]
	; CHECK: normal:			; CHECK: normal:
	; CHECK-NEXT: [[FACTOR:%.*]] = mul i32 [[RES]], -2			; CHECK-NEXT: [[RES_NEG:%.*]] = sub i32 0, [[RES]]
	; CHECK-NEXT: [[SUB2:%.*]] = add i32 [[FACTOR]], 5			; CHECK-NEXT: [[SUB1:%.*]] = add i32 [[RES_NEG]], 5
				; CHECK-NEXT: [[RES_NEG1:%.*]] = sub i32 0, [[RES]]
				; CHECK-NEXT: [[SUB2:%.*]] = add i32 [[SUB1]], [[RES_NEG1]]
	; CHECK-NEXT: ret i32 [[SUB2]]			; CHECK-NEXT: ret i32 [[SUB2]]
	; CHECK: abnormal:			; CHECK: abnormal:
	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0
	;			;
	%res = callbr i32 asm "", "=r,!i"()			%res = callbr i32 asm "", "=r,!i"()
	to label %normal [label %abnormal]			to label %normal [label %abnormal]

	normal:			normal:
	%sub1 = sub nsw i32 5, %res			%sub1 = sub nsw i32 5, %res
	%sub2 = sub nsw i32 %sub1, %res			%sub2 = sub nsw i32 %sub1, %res
	ret i32 %sub2			ret i32 %sub2

	abnormal:			abnormal:
	ret i32 0			ret i32 0
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[IR] return nullptr in Instruction::getInsertionPointAfterDef for CallBrInstAbandonedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 495166

llvm/lib/IR/Instruction.cpp

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp

llvm/test/Transforms/Coroutines/coro-debug.ll

llvm/test/Transforms/InstCombine/freeze.ll

llvm/test/Transforms/Reassociate/callbr.ll

[IR] return nullptr in Instruction::getInsertionPointAfterDef for CallBrInst
AbandonedPublic