This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/AVR/
-
Target/
-
AVR/
1/1
AVRInstrInfo.cpp
-
test/CodeGen/AVR/
-
CodeGen/
-
AVR/
-
pseudo/
2/2
COPY.mir
-
rust-bug-98167.ll

Differential D128588

[AVR] Fix expanding MOVW for overlapping registers
ClosedPublic

Authored by Patryk27 on Jun 25 2022, 10:24 AM.

Download Raw Diff

Details

Reviewers

benshi001

Commits

rG5650688e7242: [AVR] Fix expanding MOVW for overlapping registers

Summary

Sometimes the codegen emits a COPY with overlapping registers, such as
this one:

$r25r24 = COPY $r24r23

Our current expansion of such COPY would be wrong, since it'd start
with the lower register first, with the second mov assuming the
registers are not modified in-between:

mov r24, r23
mov r25, r24

(i.e. that's effectively mov r25, r23, which is _ayy ayy bad_.)

This patch improves the expansion by making it detect whether the
registers are overlapping and if so, expanding from the high register
first:

mov r25, r24
mov r24, r23

Because our registers are always paired in descending order (e.g.
there's r25r24, but no r24r25), I think it'd be safe to go with the
high-to-low expansion always (not only if the registers overlap), but
that makes the output a bit more difficult to follow and would require
adjusting the existing tests -- so I went with the more safer & simpler
route.

In the wild, this was found here:
https://github.com/rust-lang/rust/issues/98167.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Patryk27 created this revision.Jun 25 2022, 10:24 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 25 2022, 10:24 AM

Herald added subscribers: Jim, JDevlieghere, hiraditya, dylanmckay. · View Herald Transcript

Patryk27 requested review of this revision.Jun 25 2022, 10:24 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 25 2022, 10:24 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Patryk27 added a reviewer: benshi001.Jun 25 2022, 10:27 AM

Harbormaster completed remote builds in B172032: Diff 439996.Jun 25 2022, 10:58 AM

benshi001 added inline comments.Jun 25 2022, 8:43 PM

llvm/lib/Target/AVR/AVRInstrInfo.cpp
61–71	Using name `CopyHasOverlappingRegs` maybe confusing, for the case `DestHi == SrcLow`. Though there is overlapping for `r25r24 -> r24r23`, we have to still copy lower byte first. So I suggest if (DestLo == SrcHi) { ... } else { ... } And the long comment is unnecessary, since the logic is clear enough to understand.
llvm/test/CodeGen/AVR/pseudo/COPY.mir
32	Are these two `implicit killed` in the two RCALLk necessary ?

Apply code review changes

Patryk27 marked 2 inline comments as done.Jun 26 2022, 1:11 AM

Patryk27 added inline comments.

llvm/test/CodeGen/AVR/pseudo/COPY.mir
32	Without any instruction that actually uses the result of that `COPY`, LLVM seems to generate no code whatsoever - I've thought about `RCALLk` as a simple way of telling LLVM "trust me, that register is read later". Those don't have to be `killed` though, so to simplify the test, I've just changed them to `RCALLk @foo, implicit $r24r23`.

Harbormaster completed remote builds in B172070: Diff 440046.Jun 26 2022, 2:03 AM

benshi001 accepted this revision.Jun 26 2022, 2:09 AM

This revision is now accepted and ready to land.Jun 26 2022, 2:09 AM

This revision was landed with ongoing or failed builds.Jun 26 2022, 2:32 AM

Closed by commit rG5650688e7242: [AVR] Fix expanding MOVW for overlapping registers (authored by Patryk27, committed by benshi001). · Explain Why

This revision was automatically updated to reflect the committed changes.

benshi001 added a commit: rG5650688e7242: [AVR] Fix expanding MOVW for overlapping registers.

Revision Contents

Path

Size

llvm/

lib/

Target/

AVR/

AVRInstrInfo.cpp

19 lines

test/

CodeGen/

AVR/

pseudo/

COPY.mir

47 lines

rust-bug-98167.ll

22 lines

Diff 440049

llvm/lib/Target/AVR/AVRInstrInfo.cpp

	Show All 40 Lines
	void AVRInstrInfo::copyPhysReg(MachineBasicBlock &MBB,			void AVRInstrInfo::copyPhysReg(MachineBasicBlock &MBB,
	MachineBasicBlock::iterator MI,			MachineBasicBlock::iterator MI,
	const DebugLoc &DL, MCRegister DestReg,			const DebugLoc &DL, MCRegister DestReg,
	MCRegister SrcReg, bool KillSrc) const {			MCRegister SrcReg, bool KillSrc) const {
	const AVRSubtarget &STI = MBB.getParent()->getSubtarget<AVRSubtarget>();			const AVRSubtarget &STI = MBB.getParent()->getSubtarget<AVRSubtarget>();
	const AVRRegisterInfo &TRI = *STI.getRegisterInfo();			const AVRRegisterInfo &TRI = *STI.getRegisterInfo();
	unsigned Opc;			unsigned Opc;

	// Not all AVR devices support the 16-bit `MOVW` instruction.
	if (AVR::DREGSRegClass.contains(DestReg, SrcReg)) {			if (AVR::DREGSRegClass.contains(DestReg, SrcReg)) {
				// If our AVR has `movw`, let's emit that; otherwise let's emit two separate
				// `mov`s.
	if (STI.hasMOVW() && AVR::DREGSMOVWRegClass.contains(DestReg, SrcReg)) {			if (STI.hasMOVW() && AVR::DREGSMOVWRegClass.contains(DestReg, SrcReg)) {
	BuildMI(MBB, MI, DL, get(AVR::MOVWRdRr), DestReg)			BuildMI(MBB, MI, DL, get(AVR::MOVWRdRr), DestReg)
	.addReg(SrcReg, getKillRegState(KillSrc));			.addReg(SrcReg, getKillRegState(KillSrc));
	} else {			} else {
	Register DestLo, DestHi, SrcLo, SrcHi;			Register DestLo, DestHi, SrcLo, SrcHi;

	TRI.splitReg(DestReg, DestLo, DestHi);			TRI.splitReg(DestReg, DestLo, DestHi);
	TRI.splitReg(SrcReg, SrcLo, SrcHi);			TRI.splitReg(SrcReg, SrcLo, SrcHi);

	// Copy each individual register with the `MOV` instruction.			if (DestLo == SrcHi) {
				BuildMI(MBB, MI, DL, get(AVR::MOVRdRr), DestHi)
				.addReg(SrcHi, getKillRegState(KillSrc));
				BuildMI(MBB, MI, DL, get(AVR::MOVRdRr), DestLo)
				.addReg(SrcLo, getKillRegState(KillSrc));
				} else {
	BuildMI(MBB, MI, DL, get(AVR::MOVRdRr), DestLo)			BuildMI(MBB, MI, DL, get(AVR::MOVRdRr), DestLo)
	.addReg(SrcLo, getKillRegState(KillSrc));			.addReg(SrcLo, getKillRegState(KillSrc));
	BuildMI(MBB, MI, DL, get(AVR::MOVRdRr), DestHi)			BuildMI(MBB, MI, DL, get(AVR::MOVRdRr), DestHi)
	.addReg(SrcHi, getKillRegState(KillSrc));			.addReg(SrcHi, getKillRegState(KillSrc));
	}			}
				benshi001Unsubmitted Done Reply Inline Actions Using name `CopyHasOverlappingRegs` maybe confusing, for the case `DestHi == SrcLow`. Though there is overlapping for `r25r24 -> r24r23`, we have to still copy lower byte first. So I suggest if (DestLo == SrcHi) { ... } else { ... } And the long comment is unnecessary, since the logic is clear enough to understand. benshi001: Using name `CopyHasOverlappingRegs` maybe confusing, for the case `DestHi == SrcLow`. Though…
				}
	} else {			} else {
	if (AVR::GPR8RegClass.contains(DestReg, SrcReg)) {			if (AVR::GPR8RegClass.contains(DestReg, SrcReg)) {
	Opc = AVR::MOVRdRr;			Opc = AVR::MOVRdRr;
	} else if (SrcReg == AVR::SP && AVR::DREGSRegClass.contains(DestReg)) {			} else if (SrcReg == AVR::SP && AVR::DREGSRegClass.contains(DestReg)) {
	Opc = AVR::SPREAD;			Opc = AVR::SPREAD;
	} else if (DestReg == AVR::SP && AVR::DREGSRegClass.contains(SrcReg)) {			} else if (DestReg == AVR::SP && AVR::DREGSRegClass.contains(SrcReg)) {
	Opc = AVR::SPWRITE;			Opc = AVR::SPWRITE;
	} else {			} else {
	▲ Show 20 Lines • Show All 501 Lines • Show Last 20 Lines

llvm/test/CodeGen/AVR/pseudo/COPY.mir

This file was added.

				# RUN: llc -O0 %s -o - \| FileCheck %s

				--- \|
				target triple = "avr--"

				define void @test_copy_nonoverlapping() {
				entry:
				ret void
				}

				define void @test_copy_overlapping() {
				entry:
				ret void
				}

				declare void @foo(i16 %0)
				...

				---
				name: test_copy_nonoverlapping
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $r25r24

				; CHECK-LABEL: test_copy_nonoverlapping:
				; CHECK: mov r22, r24
				; CHECK-NEXT: mov r23, r25

				$r23r22 = COPY $r25r24
				RCALLk @foo, implicit $r24r23
				...
				benshi001Unsubmitted Not Done Reply Inline Actions Are these two `implicit killed` in the two RCALLk necessary ? benshi001: Are these two `implicit killed` in the two RCALLk necessary ?
				Patryk27AuthorUnsubmitted Done Reply Inline Actions Without any instruction that actually uses the result of that `COPY`, LLVM seems to generate no code whatsoever - I've thought about `RCALLk` as a simple way of telling LLVM "trust me, that register is read later". Those don't have to be `killed` though, so to simplify the test, I've just changed them to `RCALLk @foo, implicit $r24r23`. Patryk27: Without any instruction that actually uses the result of that `COPY`, LLVM seems to generate no…

				---
				name: test_copy_overlapping
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $r24r23

				; CHECK-LABEL: test_copy_overlapping:
				; CHECK: mov r25, r24
				; CHECK-NEXT: mov r24, r23

				$r25r24 = COPY $r24r23
				RCALLk @foo, implicit $r25r24
				...

llvm/test/CodeGen/AVR/rust-bug-98167.ll

This file was added.

				; RUN: llc < %s -march=avr \| FileCheck %s

				; The bug can be found here:
				; https://github.com/rust-lang/rust/issues/98167
				;
				; In this test, `extractvalue` + `call` generate a copy with overlapping
				; registers (`$r25r24 = COPY $r24r23`) that used to be expanded incorrectly.

				define void @main() {
				; CHECK-LABEL: main:
				; CHECK: rcall foo
				; CHECK-NEXT: mov r25, r24
				; CHECK-NEXT: mov r24, r23
				; CHECK-NEXT: rcall bar
				%1 = call { i8, i16 } @foo()
				%2 = extractvalue { i8, i16 } %1, 1
				call void @bar(i16 %2)
				ret void
				}

				declare { i8, i16 } @foo()
				declare void @bar(i16 %0)