This is an archive of the discontinued LLVM Phabricator instance.

I am not sure what exactly is expected here. What is your definition for pre-optimized bitcode and how your test case ensures that? Can you explain a bit more for context?

In D88114#2288732, @steven_wu wrote:

I am not sure what exactly is expected here. What is your definition for pre-optimized bitcode and how your test case ensures that? Can you explain a bit more for context?

Pre-optimized meaning before the llvm optimization pipeline is called. That's the current implementation, and the test explicitly checks that the inlining of bar into foo doesn't happen.

I could add an "alwaysinline" to bar, to further stress that.

In D88114#2288737, @mtrofin wrote:

In D88114#2288732, @steven_wu wrote:

I am not sure what exactly is expected here. What is your definition for pre-optimized bitcode and how your test case ensures that? Can you explain a bit more for context?

Pre-optimized meaning before the llvm optimization pipeline is called. That's the current implementation, and the test explicitly checks that the inlining of bar into foo doesn't happen.

I could add an "alwaysinline" to bar, to further stress that.

I think the current implementation does run optimization passes if the input is c family language and we need to keep it that way (just so that we don't do most of the optimization again). The reason you don't see it running because you are using IR as input. For Apple's implementation, we actually pass -disable-llvm-passes when the input is IR to ensure no optimization passes are running.

Harbormaster completed remote builds in B72575: Diff 293560.Sep 22 2020, 2:23 PM

In D88114#2288749, @steven_wu wrote:

In D88114#2288737, @mtrofin wrote:

In D88114#2288732, @steven_wu wrote:

I am not sure what exactly is expected here. What is your definition for pre-optimized bitcode and how your test case ensures that? Can you explain a bit more for context?

Pre-optimized meaning before the llvm optimization pipeline is called. That's the current implementation, and the test explicitly checks that the inlining of bar into foo doesn't happen.

I could add an "alwaysinline" to bar, to further stress that.

I think the current implementation does run optimization passes if the input is c family language and we need to keep it that way (just so that we don't do most of the optimization again). The reason you don't see it running because you are using IR as input. For Apple's implementation, we actually pass -disable-llvm-passes when the input is IR to ensure no optimization passes are running.

Afaik, today's implementation has 2 parts: driver and cc1. The cc1 part always emits before opt passes. The driver part, upon seeing -fembed-bitcode, splits compilation in 2 stages. Stage 1 performs ops and emits bc to a file (-emit-llvm). Stage 1 doesn't expose -fembed-bitcode to cc1. Stage 2 takes the output from stage1, disables optimizations, and adds -fembed-bitcode. So together, this gives the semantics you mentioned, but it happens that if you skip the driver and pass -fembed-bitcode to cc1, we get the pre-opt bitcode, which helps my scenario.

Ok, I guess we are on the same page. The idea sounds fine to me.

I would suggest just check that the output matches the input file as much as possible, rather than just check a label and a call instruction.

Added a C test, strenghtened the checks

In D88114#2288860, @steven_wu wrote:

Ok, I guess we are on the same page. The idea sounds fine to me.

I would suggest just check that the output matches the input file as much as possible, rather than just check a label and a call instruction.

Makes sense - also added a C test, same idea.

newline at end of file

LGTM

This revision is now accepted and ready to land.Sep 23 2020, 9:22 AM

This revision was landed with ongoing or failed builds.Sep 23 2020, 9:35 AM

Closed by commit rG437358be7179: [clang]Test ensuring -fembed-bitcode passed to cc1 captures pre-opt bitcode. (authored by mtrofin). · Explain Why

This revision was automatically updated to reflect the committed changes.

mtrofin added a commit: rG437358be7179: [clang]Test ensuring -fembed-bitcode passed to cc1 captures pre-opt bitcode..

Harbormaster completed remote builds in B72672: Diff 293761.Sep 23 2020, 9:37 AM

Harbormaster completed remote builds in B72673: Diff 293762.Sep 23 2020, 9:42 AM

Revision Contents

Path

Size

clang/

test/

Frontend/

embed-bitcode-noopt.ll

22 lines

Diff 293560

clang/test/Frontend/embed-bitcode-noopt.ll

This file was added.

				; Ensure calling bypassing the driver with -fembed-bitcode embeds bitcode pre-
				; optimizations

				; RUN: %clang_cc1 -O2 -triple x86_64-unknown-linux-gnu -emit-obj %s -o %t.o -fembed-bitcode=all
				; RUN: llvm-objcopy --dump-section=.llvmbc=%t.bc %t.o /dev/null

				; Also check that the .llvmcmd section captures the optimization options.
				; RUN: llvm-dis %t.bc -o - \| FileCheck %s --check-prefix=CHECK-BC
				; RUN: llvm-objcopy --dump-section=.llvmcmd=- %t.o /dev/null \| FileCheck %s --check-prefix=CHECK-CMD

				; CHECK-BC-LABEL: @foo
				; CHECK-BC-NEXT: call void @bar
				; CHECK-CMD: -O2

				define void @bar() {
				ret void
				}

				define void @foo() {
				call void @bar()
				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

[clang]Test ensuring -fembed-bitcode passed to cc1 captures pre-opt bitcode.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 293560

clang/test/Frontend/embed-bitcode-noopt.ll

[clang]Test ensuring -fembed-bitcode passed to cc1 captures pre-opt bitcode.
ClosedPublic