This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/MC/
-
MC/
1/2
MCObjectStreamer.cpp
-
test/MC/X86/
-
MC/
-
X86/
1/2
align-branch-64-relax-all.s

Differential D77851

[X86][MC] Make -x86-pad-max-prefix-size compatible with --mc-relax-all
ClosedPublic

Authored by skan on Apr 9 2020, 7:45 PM.

Download Raw Diff

Details

Reviewers

LuoYuanke
reames
MaskRay

Commits

rG5d73f79c5478: [X86][MC] Make -x86-pad-max-prefix-size compatible with --mc-relax-all

Summary

We allow non-relaxable instructions emitted into relaxable Fragment when we prefix padding branch. So we need to check if the instruction need relaxation before relaxing it. Without this patch, it currently triggers a report_fatal_error in llvm::MCAsmBackend::relaxInstruction when we prefix padding branch along with --mc-relax-all.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

skan created this revision.Apr 9 2020, 7:45 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 9 2020, 7:45 PM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

skan edited the summary of this revision. (Show Details)Apr 9 2020, 7:49 PM

Harbormaster failed remote builds in B52619: Diff 256488!Apr 9 2020, 8:08 PM

The failed tests are not related. I ran check-all locally and passed.

skan added a reviewer: LuoYuanke.Apr 9 2020, 11:13 PM

skan added a reviewer: reames.Apr 9 2020, 11:20 PM

LGTM. Better to wait for Philip accept.

Best to state that this currently triggers a report_fatal_error in llvm::MCAsmBackend::relaxInstruction.

llvm/test/MC/X86/align-branch-64-relax-all.s
2	You can actually strip `-pc-linux-gnu`. ELF is the default.

This revision is now accepted and ready to land.Apr 10 2020, 12:23 AM

skan edited the summary of this revision. (Show Details)Apr 10 2020, 12:39 AM

skan marked an inline comment as done.Apr 10 2020, 12:53 AM

skan added inline comments.

llvm/test/MC/X86/align-branch-64-relax-all.s
2	Thanks for the suggestion. I plan to strip all the `-pc-linux-gnu` and the blank before `# RUN` in the test files align-branch-* after landing this patch.

Nothing changed, just to trigger a new remote build

Harbormaster completed remote builds in B52655: Diff 256541.Apr 10 2020, 4:17 AM

skan edited the summary of this revision. (Show Details)Apr 10 2020, 8:29 PM

Closed by commit rG5d73f79c5478: [X86][MC] Make -x86-pad-max-prefix-size compatible with --mc-relax-all (authored by skan). · Explain WhyApr 10 2020, 9:01 PM

This revision was automatically updated to reflect the committed changes.

niosHD added a subscriber: niosHD.Apr 16 2020, 5:21 AM

It seems that this change breaks the RISCV backend. RISCVAsmBackend::relaxInstruction assumes that the Relaxed parameter is a fresh uninitialized MCInst. With this change, invalid instructions with too many operands are generated. A similar problem probably happens for the Hexagon and for the AMDGPU backend.

/cc @asb also FYI that this breaks the RISCV backend.

In D77851#1986486, @Razer6 wrote:

It seems that this change breaks the RISCV backend. RISCVAsmBackend::relaxInstruction assumes that the Relaxed parameter is a fresh uninitialized MCInst. With this change, invalid instructions with too many operands are generated. A similar problem probably happens for the Hexagon and for the AMDGPU backend.

/cc @asb also FYI that this breaks the RISCV backend.

Could you provide a new fail case that can be reproduced?

In D77851#1988060, @skan wrote:

In D77851#1986486, @Razer6 wrote:

It seems that this change breaks the RISCV backend. RISCVAsmBackend::relaxInstruction assumes that the Relaxed parameter is a fresh uninitialized MCInst. With this change, invalid instructions with too many operands are generated. A similar problem probably happens for the Hexagon and for the AMDGPU backend.

/cc @asb also FYI that this breaks the RISCV backend.

Could you provide a new fail case that can be reproduced?

yeah, I got your point. We may need to add a lit test to gurantee this assumption will not be broken.

In D77851#1986486, @Razer6 wrote:

It seems that this change breaks the RISCV backend. RISCVAsmBackend::relaxInstruction assumes that the Relaxed parameter is a fresh uninitialized MCInst. With this change, invalid instructions with too many operands are generated.

We can fix this in two ways

Change the control flow in MCObjectStreamer.cpp
or create a fresh uninitialized MCInst at the beginning of RISCVAsmBackend::relaxInstruction, and after relaxation, assign its value to Relaxed at the end of RISCVAsmBackend::relaxInstruction.

Which way do you think is better? Personally, I prefer the second one, since getAssembler().getBackend().relaxInstruction(Relaxed, STI, Relaxed) is called in a loop, the assumption that third argument of relaxInstruction is a fresh uninitialized MCInst is very strange.

In D77851#1988163, @skan wrote:

In D77851#1986486, @Razer6 wrote:

It seems that this change breaks the RISCV backend. RISCVAsmBackend::relaxInstruction assumes that the Relaxed parameter is a fresh uninitialized MCInst. With this change, invalid instructions with too many operands are generated.

We can fix this in two ways

Change the control flow in MCObjectStreamer.cpp

or create a fresh uninitialized MCInst at the beginning of RISCVAsmBackend::relaxInstruction, and after relaxation, assign its value to Relaxed at the end of RISCVAsmBackend::relaxInstruction.

Which way do you think is better? Personally, I prefer the second one, since getAssembler().getBackend().relaxInstruction(Relaxed, STI, Relaxed) is called in a loop, the assumption that third argument of relaxInstruction is a fresh uninitialized MCInst is very strange.

Special casing RISC-V looks good to me.

In D77851#1988191, @MaskRay wrote:

Which way do you think is better? Personally, I prefer the second one, since getAssembler().getBackend().relaxInstruction(Relaxed, STI, Relaxed) is called in a loop, the assumption that third argument of relaxInstruction is a fresh uninitialized MCInst is very strange.

Special casing RISC-V looks good to me.

A special case is probably also needed for the Hexagon and AMDGPU backend, which does similar stuff in their relaxInstruction implementation than the RISC-V backend.

Is their any reason why relaxInstruction takes three argurments? Since relaxed is given in two arguments which are aliased to the same variable, I would drop the third argument and let relaxInstruction to either modify the given instruction or assign it to a fresh one (RISC-V, Hexagon, AMDGPU case).

In D77851#1988272, @Razer6 wrote:

In D77851#1988191, @MaskRay wrote:

Which way do you think is better? Personally, I prefer the second one, since getAssembler().getBackend().relaxInstruction(Relaxed, STI, Relaxed) is called in a loop, the assumption that third argument of relaxInstruction is a fresh uninitialized MCInst is very strange.

Special casing RISC-V looks good to me.

A special case is probably also needed for the Hexagon and AMDGPU backend, which does similar stuff in their relaxInstruction implementation than the RISC-V backend.

Is their any reason why relaxInstruction takes three argurments? Since relaxed is given in two arguments which are aliased to the same variable, I would drop the third argument and let relaxInstruction to either modify the given instruction or assign it to a fresh one (RISC-V, Hexagon, AMDGPU case).

Droppng the third argument looks good to me.

In D77851#1988272, @Razer6 wrote:

In D77851#1988191, @MaskRay wrote:

Which way do you think is better? Personally, I prefer the second one, since getAssembler().getBackend().relaxInstruction(Relaxed, STI, Relaxed) is called in a loop, the assumption that third argument of relaxInstruction is a fresh uninitialized MCInst is very strange.

Special casing RISC-V looks good to me.

A special case is probably also needed for the Hexagon and AMDGPU backend, which does similar stuff in their relaxInstruction implementation than the RISC-V backend.

Is their any reason why relaxInstruction takes three argurments? Since relaxed is given in two arguments which are aliased to the same variable, I would drop the third argument and let relaxInstruction to either modify the given instruction or assign it to a fresh one (RISC-V, Hexagon, AMDGPU case).

I am working on this fix.

skan mentioned this in D78364: [MC][Bugfix] Remove redundant parameter for relaxInstruction.Apr 17 2020, 6:28 AM

In D77851#1988060, @skan wrote:

Could you provide a new fail case that can be reproduced?

https://bugs.llvm.org/show_bug.cgi?id=45580

this broke our Linux kernel builds of RISCV, too.

Has this been reverted? I see discussion of a fix being worked on, but has the change which triggered problems been reverted? I can't tell easily from the review history.

LuoYuanke added inline comments.Apr 17 2020, 6:08 PM

llvm/lib/MC/MCObjectStreamer.cpp
404	If target expected "Relaxed" be a fresh MCInst, then this line also has problem. I think for RISC-V, this line isn't been executed, so the problem is not exposed before. I prefer to fix the bug in MCObjectStreamer.cpp, because some other target may also append new operand to Relaxed MCInst. And the relaxInstruction() API seems imply the Relaxed MCInst is fresh. while (getAssembler().getBackend().mayNeedRelaxation(Inst, STI)) { MCInst Relaxed; getAssembler().getBackend().relaxInstruction(Inst, STI, Relaxed); Inst = Relaxed; }

In D77851#1989744, @reames wrote:

Has this been reverted? I see discussion of a fix being worked on, but has the change which triggered problems been reverted? I can't tell easily from the review history.

No, we plan to fix based on this.

skan marked an inline comment as done.Apr 17 2020, 11:30 PM

skan added inline comments.

llvm/lib/MC/MCObjectStreamer.cpp
404	The code you paste doesn't make sense, `Inst` is a `const MCInst &`.

skan mentioned this in rG8bb059ab6379: [MC][Bugfix] Remove redundant parameter for relaxInstruction.Apr 20 2020, 8:36 PM

Revision Contents

Path

Size

llvm/

lib/

MC/

MCObjectStreamer.cpp

3 lines

test/

MC/

X86/

align-branch-64-relax-all.s

1 line

Diff 256488

llvm/lib/MC/MCObjectStreamer.cpp

Show First 20 Lines • Show All 393 Lines • ▼ Show 20 Lines	void MCObjectStreamer::emitInstructionImpl(const MCInst &Inst,

// Otherwise, relax and emit it as data if either:		// Otherwise, relax and emit it as data if either:
// - The RelaxAll flag was passed		// - The RelaxAll flag was passed
// - Bundling is enabled and this instruction is inside a bundle-locked		// - Bundling is enabled and this instruction is inside a bundle-locked
// group. We want to emit all such instructions into the same data		// group. We want to emit all such instructions into the same data
// fragment.		// fragment.
if (Assembler.getRelaxAll() \|\|		if (Assembler.getRelaxAll() \|\|
(Assembler.isBundlingEnabled() && Sec->isBundleLocked())) {		(Assembler.isBundlingEnabled() && Sec->isBundleLocked())) {
MCInst Relaxed;		MCInst Relaxed = Inst;
getAssembler().getBackend().relaxInstruction(Inst, STI, Relaxed);
while (getAssembler().getBackend().mayNeedRelaxation(Relaxed, STI))		while (getAssembler().getBackend().mayNeedRelaxation(Relaxed, STI))
getAssembler().getBackend().relaxInstruction(Relaxed, STI, Relaxed);		getAssembler().getBackend().relaxInstruction(Relaxed, STI, Relaxed);
		LuoYuankeUnsubmitted Not Done Reply Inline Actions If target expected "Relaxed" be a fresh MCInst, then this line also has problem. I think for RISC-V, this line isn't been executed, so the problem is not exposed before. I prefer to fix the bug in MCObjectStreamer.cpp, because some other target may also append new operand to Relaxed MCInst. And the relaxInstruction() API seems imply the Relaxed MCInst is fresh. while (getAssembler().getBackend().mayNeedRelaxation(Inst, STI)) { MCInst Relaxed; getAssembler().getBackend().relaxInstruction(Inst, STI, Relaxed); Inst = Relaxed; } LuoYuanke: If target expected "Relaxed" be a fresh MCInst, then this line also has problem. I think for…
		skanAuthorUnsubmitted Done Reply Inline Actions The code you paste doesn't make sense, `Inst` is a `const MCInst &`. skan: The code you paste doesn't make sense, `Inst` is a `const MCInst &`.
EmitInstToData(Relaxed, STI);		EmitInstToData(Relaxed, STI);
return;		return;
}		}

// Otherwise emit to a separate fragment.		// Otherwise emit to a separate fragment.
EmitInstToFragment(Inst, STI);		EmitInstToFragment(Inst, STI);
}		}

▲ Show 20 Lines • Show All 358 Lines • Show Last 20 Lines

llvm/test/MC/X86/align-branch-64-relax-all.s

	# RUN: llvm-mc -filetype=obj -triple x86_64-pc-linux-gnu --x86-align-branch-boundary=32 --x86-align-branch=fused+jcc --mc-relax-all %s \| llvm-objdump -d --no-show-raw-insn - \| FileCheck %s			# RUN: llvm-mc -filetype=obj -triple x86_64-pc-linux-gnu --x86-align-branch-boundary=32 --x86-align-branch=fused+jcc --mc-relax-all %s \| llvm-objdump -d --no-show-raw-insn - \| FileCheck %s
				# RUN: llvm-mc -filetype=obj -triple x86_64-pc-linux-gnu --x86-align-branch-boundary=32 --x86-align-branch=fused+jcc --x86-pad-max-prefix-size=5 --mc-relax-all %s \| llvm-objdump -d --no-show-raw-insn - \| FileCheck %s
				MaskRayUnsubmitted Not Done Reply Inline Actions You can actually strip `-pc-linux-gnu`. ELF is the default. MaskRay: You can actually strip `-pc-linux-gnu`. ELF is the default.
				skanAuthorUnsubmitted Done Reply Inline Actions Thanks for the suggestion. I plan to strip all the `-pc-linux-gnu` and the blank before `# RUN` in the test files align-branch-* after landing this patch. skan: Thanks for the suggestion. I plan to strip all the `-pc-linux-gnu` and the blank before `# RUN`…

	# Check instructions can be aligned correctly along with option --mc-relax-all			# Check instructions can be aligned correctly along with option --mc-relax-all

	.text			.text
	.global foo			.global foo
	foo:			foo:
	.p2align 5			.p2align 5
	.rept 25			.rept 25
	Show All 33 Lines