This is an archive of the discontinued LLVM Phabricator instance.

This is still wrong. The problem isn't the explicit swap that's being changed; it's the reinterpret_cast which implicitly depends on endianness (and breaks strict aliasing).

Probably clearer to explicitly write out the conversion from uint64_t explicitly. For example, instead of uint16_t Word = Words[i];, instead write uint16_t Word = (uint16_t)(Val >> (WordCount * 16));.

Remove reinterpret_cast as suggested by @efriedma
Force conversion native -> little

This patche validates on alla rchitectures supported by fedora, see https://koji.fedoraproject.org/koji/taskinfo?taskID=37835735

How have you been testing this? How many regression test failures are there on s390x? The first version of the patch should not have passed all the regression tests on a big-endian target, I think. (In particular, it should not have worked for 32-bit instructions, like llvm/test/MC/AVR/inst-jmp.s).

Probably want to wait for Dylan to comment, but looks fine. (Changing the code to use support::endian::write isn't necessary, but it's more readable.)

How have you been testing this?

Rebuilding llvm package with AVR support on alla supported arch using Fedora buildsystem

How many regression test failures are there on s390x? The first version of the patch should not have passed all the regression tests on a big-endian target, I think. (In particular, it should not have worked for 32-bit instructions, like llvm/test/MC/AVR/inst-jmp.s).

Yeah, the first version of the patch was not tested correctly (I was explicilty removing AVR from the targets for s390x). When tested correctly, most MC/AVR tests where failing. This is no longer the case (tests activated and pass).

Probably want to wait for Dylan to comment, but looks fine. (Changing the code to use support::endian::write isn't necessary, but it's more readable.)

OK, let's wait for @dylanmckay input for a few days!

Ping.

Nice patch, sorry for taking so long to getting around to this.

@dylanmckay Why is the byte-swapping need here in the first place?

It was quite a long time ago when written, I can't recall this particular logic. I suspect it was a misunderstanding on my part of the format of uint64_t Val.

I've left a couple comments, I'm happy to accept this patch now with the two minor nitpicks fixed.

I've tested this locally and it doesn't cause any regressions, I agree that the reinterpret cast in its prior form makes incorrect assumptions about the host byte ordering, good catch @serge-sans-paille!

llvm/lib/Target/AVR/MCTargetDesc/AVRMCCodeEmitter.cpp
29	Place `#include`s in alphabetical order CodingStandards
275	Nitpick: I recommend adding parentheses around `i * 16` so the reader doesn't have to be conscious about precedence rules to mentally parse this

This revision is now accepted and ready to land.Nov 24 2019, 9:51 PM

Thanks for the review! I'll do the update and merge the patch.

Closed by commit rG29b4d8f19e30: [AVR] Fix endianness handling in AVR MC (authored by serge-sans-paille). · Explain WhyNov 25 2019, 2:43 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

Target/

AVR/

MCTargetDesc/

AVRMCCodeEmitter.cpp

8 lines

Diff 221462

llvm/lib/Target/AVR/MCTargetDesc/AVRMCCodeEmitter.cpp

Show All 20 Lines
#include "llvm/MC/MCExpr.h"		#include "llvm/MC/MCExpr.h"
#include "llvm/MC/MCFixup.h"		#include "llvm/MC/MCFixup.h"
#include "llvm/MC/MCInst.h"		#include "llvm/MC/MCInst.h"
#include "llvm/MC/MCInstrInfo.h"		#include "llvm/MC/MCInstrInfo.h"
#include "llvm/MC/MCRegisterInfo.h"		#include "llvm/MC/MCRegisterInfo.h"
#include "llvm/MC/MCSubtargetInfo.h"		#include "llvm/MC/MCSubtargetInfo.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
		#include "llvm/Support/EndianStream.h"
		dylanmckayUnsubmitted Not Done Reply Inline Actions Place `#include`s in alphabetical order CodingStandards dylanmckay: Place `#include`s in alphabetical order [CodingStandards](https://llvm.org/docs/CodingStandards.

#define DEBUG_TYPE "mccodeemitter"		#define DEBUG_TYPE "mccodeemitter"

#define GET_INSTRMAP_INFO		#define GET_INSTRMAP_INFO
#include "AVRGenInstrInfo.inc"		#include "AVRGenInstrInfo.inc"
#undef GET_INSTRMAP_INFO		#undef GET_INSTRMAP_INFO

namespace llvm {		namespace llvm {
▲ Show 20 Lines • Show All 226 Lines • ▼ Show 20 Lines	unsigned AVRMCCodeEmitter::getMachineOpValue(const MCInst &MI,
assert(MO.isExpr());		assert(MO.isExpr());

return getExprOpValue(MO.getExpr(), Fixups, STI);		return getExprOpValue(MO.getExpr(), Fixups, STI);
}		}

void AVRMCCodeEmitter::emitInstruction(uint64_t Val, unsigned Size,		void AVRMCCodeEmitter::emitInstruction(uint64_t Val, unsigned Size,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
raw_ostream &OS) const {		raw_ostream &OS) const {
const uint16_t Words = reinterpret_cast<uint16_t const >(&Val);
size_t WordCount = Size / 2;		size_t WordCount = Size / 2;

for (int64_t i = WordCount - 1; i >= 0; --i) {		for (int64_t i = WordCount - 1; i >= 0; --i) {
uint16_t Word = Words[i];		uint16_t Word = (Val >> i * 16) & 0xFFFF;
		dylanmckayUnsubmitted Not Done Reply Inline Actions Nitpick: I recommend adding parentheses around `i * 16` so the reader doesn't have to be conscious about precedence rules to mentally parse this dylanmckay: Nitpick: I recommend adding parentheses around `i * 16` so the reader doesn't have to be…
		support::endian::write(OS, Word, support::endianness::little);
OS << (uint8_t) ((Word & 0x00ff) >> 0);
OS << (uint8_t) ((Word & 0xff00) >> 8);
}		}
}		}

void AVRMCCodeEmitter::encodeInstruction(const MCInst &MI, raw_ostream &OS,		void AVRMCCodeEmitter::encodeInstruction(const MCInst &MI, raw_ostream &OS,
SmallVectorImpl<MCFixup> &Fixups,		SmallVectorImpl<MCFixup> &Fixups,
const MCSubtargetInfo &STI) const {		const MCSubtargetInfo &STI) const {
const MCInstrDesc &Desc = MCII.get(MI.getOpcode());		const MCInstrDesc &Desc = MCII.get(MI.getOpcode());

Show All 18 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Fix endianness handling in AVR MCClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 221462

llvm/lib/Target/AVR/MCTargetDesc/AVRMCCodeEmitter.cpp

Fix endianness handling in AVR MC
ClosedPublic