This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Target/MSP430/
-
Target/
-
MSP430/
-
MSP430ISelLowering.cpp
-
test/CodeGen/MSP430/
-
CodeGen/
-
MSP430/
-
longret.ll

Differential D1046

Reverse the order of allocation of return value registers to be compatible with MSPGCC
AbandonedPublic

Authored by jobnoorman on Jun 26 2013, 9:20 AM.

Download Raw Diff

Details

Reviewers

asl

Summary

LLVM uses the exact reverse order of return registers as MSPGCC. This patch fixes this.

Note that I did not find a clean way to do this using tablegen so the method I used might look like a hack :-)

Diff Detail

Event Timeline

The register allocation order is defined by RetCC_MSP430 entry inside MSP430CallingConv.td

Yes, I have been playing around with the calling convention definitions but I think there is no way to match MSPGCC's return value convention. Let me try to explain what I think the problem is.

First and for all, before the ISel phase even begins, all values larger than i16 are lowered (is that the correct terminology?) to a number of i16 values. This means using something like

CCIfType<[i64], CCAssignToReg<[R12W, R13W, R14W, R15W]>>

doesn't do anything since i64 types simply do not exist any more at this point.

So returning a value larger than i16 will result in returning multiple i16 values and I have found no way to convince tablegen to put those in the correct registers.

Here is MSPGCC's return value calling convention (I'll use little endian ordering of the registers):

Type	Registers
i16	r15
i32	r15:r14
i64	r15:r14:r13:r12

Currently, LLVM uses the following rule:

CCIfType<[i16], CCAssignToReg<[R15W, R14W, R13W, R12W]>>

which produces this register assignment:

Type	Registers
i16	r15
i32	r14:r15
i64	r12:r13:r14:r15

Although this is the exact reverse of what MSPGCC does, we cannot simply reverse the tablegen rule to become

CCIfType<[i16], CCAssignToReg<[R12W, R13W, R14W, R15W]>>

since this will produce the following register assignment:

Type	Registers
i16	r12
i32	r13:r12
i64	r15:r14:r13:r12

Therefore, my patch will simply reverse the order of allocated registers after the tablegen rule has been applied which always does the correct thing.

This is hackish solution, because it seems to the fix outcome of the problem, but not the problem by itself. It looks like we need to change the splitting of i32 / i64 here. Because otherwise the difference might be seen in other places.... (what's about e.g. i32 stores?)

I completely agree this is a hackish solution and we need something better. However, I do not think there is a problem with the splitting of i32/i64 since it mostly does what it should do. For example, loads/stores of these values work correctly since codegen knows we are little endian.

Also, for the calling convention, I think codegen is doing the "logical" thing. Return values are assigned to R15-R12 and codegen assigns the return value in little endian byte order to these registers (R15=LSB, r12=MSB). The thing is that the calling convention used by MSPGCC is kind of weird: registers should also be picked in the order R15-R12 but then the actual return value should be assigned to these registers in big endian byte order (R15=MSB, R12=LSB). Also note that exactly the same happens for i32/i64 arguments. This is actually a bigger problem since my hack won't work for arguments :-)

After looking a bit longer at the code, I don't really see a clean way to solve this problem. Do you think it it might be a good option to implement the calling convention "by hand"? That is, without using tablegen.

This revision is obsoleted by D1086.

Revision Contents

Path

Size

lib/

Target/

MSP430/

MSP430ISelLowering.cpp

19 lines

test/

CodeGen/

MSP430/

longret.ll

31 lines

Diff 2577

lib/Target/MSP430/MSP430ISelLowering.cpp

Context not available.

	#include "MSP430GenCallingConv.inc"	#include "MSP430GenCallingConv.inc"

		template<typename IOArg>
		static void UpdateReturnAnalysis(SmallVectorImpl<CCValAssign>& RVLocs,
		const SmallVectorImpl<IOArg> &Args) {
		unsigned Size = Args.size();
		if (Size <= 1)
		return;

		for (unsigned i = 0, e = Size / 2; i != e; ++i) {
		CCValAssign &Left = RVLocs[i];
		CCValAssign &Right = RVLocs[Size - i - 1];
		CCValAssign Tmp = Left;
		Left = Right;
		Right = Tmp;
		}
		}

	SDValue	SDValue
	MSP430TargetLowering::LowerFormalArguments(SDValue Chain,	MSP430TargetLowering::LowerFormalArguments(SDValue Chain,
	CallingConv::ID CallConv,	CallingConv::ID CallConv,
Context not available.

	// Analize return values.	// Analize return values.
	CCInfo.AnalyzeReturn(Outs, RetCC_MSP430);	CCInfo.AnalyzeReturn(Outs, RetCC_MSP430);
		UpdateReturnAnalysis(RVLocs, Outs);

	SDValue Flag;	SDValue Flag;
	SmallVector<SDValue, 4> RetOps(1, Chain);	SmallVector<SDValue, 4> RetOps(1, Chain);
Context not available.
	getTargetMachine(), RVLocs, *DAG.getContext());	getTargetMachine(), RVLocs, *DAG.getContext());

	CCInfo.AnalyzeCallResult(Ins, RetCC_MSP430);	CCInfo.AnalyzeCallResult(Ins, RetCC_MSP430);
		UpdateReturnAnalysis(RVLocs, Ins);

	// Copy all of the result registers out of their specified physreg.	// Copy all of the result registers out of their specified physreg.
	for (unsigned i = 0; i != RVLocs.size(); ++i) {	for (unsigned i = 0; i != RVLocs.size(); ++i) {
Context not available.
	return false;	return false;
	}	}


	const char *MSP430TargetLowering::getTargetNodeName(unsigned Opcode) const {	const char *MSP430TargetLowering::getTargetNodeName(unsigned Opcode) const {
	switch (Opcode) {	switch (Opcode) {
	default: return NULL;	default: return NULL;
Context not available.

test/CodeGen/MSP430/longret.ll

This file was added.

				; RUN: llc < %s \| FileCheck %s

				target datalayout = "e-p:16:16:16-i8:8:8-i16:16:16-i32:16:32-n8:16"
				target triple = "msp430---elf"

				@var = common global i64 0, align 8

				define i64 @callee() #0 {
				entry:
				; CHECK: callee:
				; CHECK: mov.w #1800, r12
				; CHECK: mov.w #1286, r13
				; CHECK: mov.w #772, r14
				; CHECK: mov.w #258, r15
				; CHECK: ret
				ret i64 72623859790382856
				}

				define void @caller() #0 {
				; CHECK: caller:
				; CHECK: call #callee
				%1 = call i64 @callee()
				; CHECK: mov.w r15, &var+6
				; CHECK: mov.w r14, &var+4
				; CHECK: mov.w r13, &var+2
				; CHECK: mov.w r12, &var
				store i64 %1, i64* @var, align 8
				ret void
				}

				attributes #0 = { nounwind "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf"="true" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "unsafe-fp-math"="false" "use-soft-float"="false" }