This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/IR/
-
mlir/
-
IR/
4/4
Attributes.h
-
lib/
-
IR/
6/6
AsmPrinter.cpp
7/7
Attributes.cpp
-
Parser/
4/4
AttributeParser.cpp
-
test/IR/
-
IR/
-
dense-elements-hex.mlir

Differential D80695

[mlir] Convert raw data in dense element attributes for big-endian machines.
ClosedPublic

Authored by imaihal on May 28 2020, 12:25 AM.

Download Raw Diff

Details

Reviewers

rriddle

Commits

rGa66e334cebec: [mlir] Convert raw data in dense element attributes for big-endian machines.

Summary

This patch fixes a bug 46091

Raw data for the dense-element attribute is written in little endian (LE) format.
This commit converts the format to big endian (BE) in ʻAttribute Parser` on the
BE machine. Also, when outputting on a BE machine, the BE format is converted
to LE in "AsmPrinter".

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

imaihal created this revision.May 28 2020, 12:25 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 28 2020, 12:25 AM

Herald added subscribers: llvm-commits, jurahul, Kayjukh and 14 others. · View Herald Transcript

Harbormaster failed remote builds in B58173: Diff 266759!May 28 2020, 12:30 AM

Fixed "No newline at end of file"

Harbormaster failed remote builds in B58174: Diff 266760!May 28 2020, 1:02 AM

Rebased with master

Harbormaster completed remote builds in B58228: Diff 266869.May 28 2020, 9:50 AM

imaihal edited the summary of this revision. (Show Details)May 28 2020, 10:10 PM

imaihal edited the summary of this revision. (Show Details)

Could anyone be a reviewer of this patch? This patch is about one of the MLIR test (dense-elements-hex.mlir).

Herald added a project: Restricted Project. · View Herald TranscriptJun 2 2020, 5:45 PM

@rriddle Hi, is it possible to give me your comments about this patch? I'm asking because you were a reviewer of the previous patch including this file.

Herald added subscribers: msifontes, aartbik. · View Herald TranscriptJun 29 2020, 8:17 PM

Hi, I'm looking for reviewers for this patch. This patch is old. So, I need to update, but I would like to hear your comments about how we should pass this test on big-endian machines. Since current test includes HEX of little endian, this test fails on big-endian machines.
@jpienaar, @ftynse, @mehdi_amini, @rriddle @Smit You were reviewers for patches including this test before. Could you give me your comments?

Rephrasing the question.

The original mlir/test/IR/dense-elements-hex.mlir file assumes that test machines are Little Endian. IBM has big endian machines, and we would like to have a clean run on Big Endian machines as well.

What is your preferred way to handle the situation?

Two different directories with endian-specific files,
two different files in the current directory, or
is there some ways to run the same file but with different checks?

We will implement your preferred approach. Thanks for the help.

Could the HEX dump be made independent of the platform? Having the printed format dumped on a BE loadable on a LE and vice-versa would be nicer I think.

In D80695#2251361, @mehdi_amini wrote:

Could the HEX dump be made independent of the platform? Having the printed format dumped on a BE loadable on a LE and vice-versa would be nicer I think.

Thanks for your suggestion. Test code that does not depend on endianness is better. I'm checking other test codes in LLVM. If you know such an example, please let me know. That should be helpful.

I think looking for where the endian::read method is used in the codebase can yield some inspiration, for example: https://github.com/llvm/llvm-project/blob/master/llvm/lib/Remarks/YAMLRemarkParser.cpp#L76

Thanks. I checked the source code. The dense attribute with hex data is parsed here https://github.com/llvm/llvm-project/blob/master/mlir/lib/Parser/AttributeParser.cpp#L682-L684
If we can assume the hex data is always little-endian, it can be converted here, but we can't assume that. So, my understanding is that the hex data of dense-elements-hex.mlir should be generated depending on the endianness. Is this possible? I'm looking for similar examples.

If we can assume the hex data is always little-endian, it can be converted here, but we can't assume that.

Why? We control the printer and parser, don't we?

In D80695#2262458, @mehdi_amini wrote:

If we can assume the hex data is always little-endian, it can be converted here, but we can't assume that.

Why? We control the printer and parser, don't we?

The hex of current dense-elements-hex.mlir is LE. So, we can pass the test by controlling the printer and parser. ( LE to BE in the parser, and BE to LE in the printer)
If we can force users to write LE in the attribute, there is no problem. However, users may write the hex of BE in the attributes in BE machines.
In that case, the parser and printer should not convert the hex, but since the parser and printer don't know whether the hex in the attribute is BE or LE, I think they can not process correctly.

If we can force users to write LE in the attribute, there is no problem. However, users may write the hex of BE in the attributes in BE machines.

I don't understand what you're referring to right now: can you elaborate how a user would write the hex in BE?
I expect a users to form the data in memory in BE on a BE machine, and then the printer will print in LE form always.

Just my two cents; LE vs BE typically gives problem when a program read data with one data size, and write data with another size. So if the reader in the dense attributes respect the type size, namely read and write the data using the same format, and then generate the hex directly from the register content, then there are no issues with endianness.

From past optimizations, there are case when you want to write 64 bits at once, but later want to ready only 8 bits... then endianness is an issue as the program needs to know where to find the right 8 among the 64 in memory. But I don't think this is the case here, as the dense attributes are clearly typed and I see no reason not to read & write using the same one.

In D80695#2262569, @mehdi_amini wrote:

I don't understand what you're referring to right now: can you elaborate how a user would write the hex in BE?
I expect a users to form the data in memory in BE on a BE machine, and then the printer will print in LE form always.

I thought native byte order should be written in dense.attr. (Users write BE HEX when they use BE machine) If this is not preferable, I think it would be better to describe it in MLIR specification.

Following link is just an example of ONNX.
https://github.com/onnx/onnx/blob/3368834cf0b1f0ab9838cf6bdf78a27299d08187/onnx/onnx.in.proto#L538-L539

In D80695#2265193, @imaihal wrote:

I thought native byte order should be written in dense.attr.

Well that's my point all along: let's not do that and make the serialization/deserialization cross-platform.

Users write BE HEX

Nit: users don't write HEX I believe, the MLIR printer does for them, and we control the printer I believe, at least for the standard types.

In D80695#2266442, @mehdi_amini wrote:

In D80695#2265193, @imaihal wrote:

I thought native byte order should be written in dense.attr.

Well that's my point all along: let's not do that and make the serialization/deserialization cross-platform.

OK. I will implement the conversion, assuming that HEX in dense.attr is always LE. Thanks!

Herald added a subscriber: tatianashp. · View Herald TranscriptSep 14 2020, 5:48 PM

Implemented the conversion from LE to BE in AttributeParser and from BE to LE in AsmPrinter
, assuming that HEX in dense.attr is always LE.

Herald added a reviewer: rriddle. · View Herald TranscriptOct 19 2020, 8:52 AM

Herald added subscribers: rdzhabarov, mgorny. · View Herald Transcript

imaihal retitled this revision from [mlir] Added big endian version of "dense-elements-hex.mlir" to [mlir] Convert raw data in dense element attributes into big-endian format..Oct 19 2020, 8:54 AM

imaihal edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B75545: Diff 299073.Oct 19 2020, 9:17 AM

mehdi_amini added inline comments.Oct 19 2020, 8:26 PM

mlir/lib/Support/EndianUtilities.cpp
44 ↗	(On Diff #299073)	I expect a MutableArrayRef here?
52 ↗	(On Diff #299073)	Can you `assert(numElements*storageBitWidth == inRawData.size())` and `inRawData.size() <= outRawData.size()` ?
54 ↗	(On Diff #299073)	It does not work with `const ulittle16_t ` ? If so, can you extract the const_cast outside of the sequence of `if` (and make it a switch?): char rawDataBegin = const_cast<char >(inRawData.begin()); switch (storageBitWidth) { case 16: { ulittle16_t inRawDataPos = reinterpret_cast<ulittle16_t >(rawDataBegin); uint16_t outDataPos = reinterpret_cast<uint16_t *>(outRawData.begin()); for (unsigned i = 0, e = numElements; i < e; ++i) std::copy_n(inRawDataPos + i, 1, outDataPos + i); } ...
59 ↗	(On Diff #299073)	It isn't clear to me right now why you need a loop instead of using the second argument of std::copy_n? std::copy_n(inRawDataPos, numElements, outDataPos);

Reflected @mehdi_amini's comments.

imaihal marked 4 inline comments as done.Oct 20 2020, 6:29 AM

imaihal added inline comments.

mlir/lib/Support/EndianUtilities.cpp
54 ↗	(On Diff #299073)	`const ulittle16_t *` works. Thanks!
59 ↗	(On Diff #299073)	Sorry, it works.

@mehdi_amini Thanks for your review! I reflected all of your comments.

Harbormaster completed remote builds in B75697: Diff 299349.Oct 20 2020, 6:36 AM

imaihal retitled this revision from [mlir] Convert raw data in dense element attributes into big-endian format. to [mlir] Convert raw data in dense element attributes for big-endian machines..Oct 20 2020, 6:48 AM

imaihal edited the summary of this revision. (Show Details)Oct 20 2020, 7:00 AM

rriddle requested changes to this revision.Oct 20 2020, 10:24 AM

rriddle added inline comments.

mlir/lib/Support/EndianUtilities.cpp
44 ↗	(On Diff #299349)	Can you just make this a utility method on DenseIntOrFpElementsAttr itself? This removes the duplicated code, and also removes the need to break library layering by having Support/ depend on IR/.

This revision now requires changes to proceed.Oct 20 2020, 10:24 AM

Moved the convEndianBE into DenseIntOrFPElementsAttr class and removed EndianUtilities.

Herald added a subscriber: jdoerfert. · View Herald TranscriptOct 20 2020, 10:48 PM

Harbormaster completed remote builds in B75816: Diff 299554.Oct 20 2020, 11:04 PM

Removed unnecessary headers.

imaihal marked an inline comment as done.Oct 20 2020, 11:18 PM

imaihal added inline comments.

mlir/lib/Support/EndianUtilities.cpp
44 ↗	(On Diff #299349)	Thanks for your review. I moved it to `DenseIntOrFPElementsAttr` .

Harbormaster completed remote builds in B75818: Diff 299557.Oct 20 2020, 11:35 PM

rriddle added inline comments.Oct 21 2020, 6:17 PM

mlir/include/mlir/IR/Attributes.h
1145	Can you rename the function to something a bit longer and more descriptive? I don't expect this function to be called very often, if ever by users, so a longer function name isn't that much of a detriment.
mlir/lib/IR/AsmPrinter.cpp
1509	If this machine is already little-endian, can we remove the redundant copy?
1512–1514	This doesn't look correct, you aren't initializing the SmallVector meaning that `data` is not guaranteed to be a valid storage pointer. You also don't need to manually construct a MutableArrayRef, it should be implicitly constructible from SmallVector.
mlir/lib/IR/Attributes.cpp
24	Can you move these inside of convEndianBE?
mlir/lib/Parser/AttributeParser.cpp
700	If this machine is already little-endian, can we remove the redundant copy?
705	Same comments here.

Reflected comments and rebased.

imaihal marked 5 inline comments as done.Oct 26 2020, 7:49 AM

imaihal added inline comments.

mlir/lib/IR/AsmPrinter.cpp
1509	I couldn't find good way without checking system_endianness to avoid redundant copy for little endian. Little endian code is the same with original one.

Harbormaster completed remote builds in B76400: Diff 300668.Oct 26 2020, 8:07 AM

imaihal marked an inline comment as done.Oct 26 2020, 5:30 PM

rriddle added inline comments.Oct 26 2020, 5:45 PM

mlir/include/mlir/IR/Attributes.h
1143	Should we even support calling this if the machine is already LE?
mlir/lib/IR/AsmPrinter.cpp
1514	I may be confused right now, but doesn't `convertEndianOfArrayRefForBEmachine` just convert from little to big and not big to little?
mlir/lib/IR/Attributes.cpp
1123	Add an assert that the current machine is big endian here and in the function below?
1154	Is this check necessary anymore?
1156	nit: Just use `size_t` for `i` to remove the cast on `nBytes`. Or switch both to `ssize_t`.
1162	Drop this empty return.
1177	Can you drop the `DenseIntOrFPElementsAttr::` here?
mlir/lib/Parser/AttributeParser.cpp
680	Can you just use these inline and drop the `using`? I don't think it saves much.
725	nit: Drop the else after return here.

@rriddle Thanks for your comments. I answered some of the comments.

mlir/include/mlir/IR/Attributes.h
1143	We don't have to call this function in LE machines, but we can call this even in LE machines. LE machine: `inRawData` LE -> `outRawData` LE BE machine: `inRawData` LE -> `outRawData` BE As you wrote before, there are redundant copy when we use this function in LE machine. To avoid this redundant copy, I think we need to check machine endianness as in this patch.
mlir/lib/IR/AsmPrinter.cpp
1514	This may be confusing, but this also converts big to little on BE machine. `copy_n` copies `inRawData`(`ulittle`) to `outRawData`(`uint`). This copy assumes `inRawData` is LE format. So, this copy_n always converts endianness on BE machine even in actual `inRawData` is BE format. Normally this is used when `inRawData` is LE format, but I reused to convert BE to LE.

LGTM after resolving the remaining comments. Thanks for pushing on this!

mlir/include/mlir/IR/Attributes.h
1143	Yeah, that's why I'd be okay with asserting inside of these functions. It could prevent some accidental usages of these functions when they shouldn't be used. Dense data gets extremely large.
mlir/lib/IR/AsmPrinter.cpp
1514	Okay, that kind of makes sense now.

This revision is now accepted and ready to land.Oct 26 2020, 7:06 PM

Reflect comments.

Harbormaster completed remote builds in B76550: Diff 300988.Oct 27 2020, 8:14 AM

Rebased.

Harbormaster completed remote builds in B76579: Diff 301019.Oct 27 2020, 10:07 AM

Avoid warning in assert().

Harbormaster completed remote builds in B76671: Diff 301166.Oct 27 2020, 8:03 PM

Avoid warning message.

Harbormaster completed remote builds in B76673: Diff 301169.Oct 27 2020, 8:33 PM

imaihal added inline comments.Oct 27 2020, 8:57 PM

mlir/lib/IR/Attributes.cpp
1128–1129	Inserted `NOLINT` to avoid warning messages in clang-tidy. The warning message suggested to use `static_assert()` here, but it is not appropriate here.

@rriddle Thanks for your review. I don't have commit access. If you don't have other comments, could you commit this patch?

Closed by commit rGa66e334cebec: [mlir] Convert raw data in dense element attributes for big-endian machines. (authored by imaihal, committed by rriddle). · Explain WhyOct 28 2020, 5:13 PM

This revision was automatically updated to reflect the committed changes.

rriddle added a commit: rGa66e334cebec: [mlir] Convert raw data in dense element attributes for big-endian machines..

Revision Contents

Path

Size

mlir/

include/

mlir/

IR/

Attributes.h

19 lines

lib/

IR/

AsmPrinter.cpp

18 lines

Attributes.cpp

59 lines

Parser/

AttributeParser.cpp

15 lines

test/

IR/

dense-elements-hex.mlir

9 lines

Diff 301484

mlir/include/mlir/IR/Attributes.h

	Show First 20 Lines • Show All 1,134 Lines • ▼ Show 20 Lines
	/// densely packed string arrays.			/// densely packed string arrays.
	class DenseIntOrFPElementsAttr			class DenseIntOrFPElementsAttr
	: public Attribute::AttrBase<DenseIntOrFPElementsAttr, DenseElementsAttr,			: public Attribute::AttrBase<DenseIntOrFPElementsAttr, DenseElementsAttr,
	detail::DenseIntOrFPElementsAttributeStorage> {			detail::DenseIntOrFPElementsAttributeStorage> {

	public:			public:
	using Base::Base;			using Base::Base;

				/// Convert endianess of input ArrayRef for big-endian(BE) machines. All of
				rriddleUnsubmitted Done Reply Inline Actions Should we even support calling this if the machine is already LE? rriddle: Should we even support calling this if the machine is already LE?
				imaihalAuthorUnsubmitted Done Reply Inline Actions We don't have to call this function in LE machines, but we can call this even in LE machines. LE machine: `inRawData` LE -> `outRawData` LE BE machine: `inRawData` LE -> `outRawData` BE As you wrote before, there are redundant copy when we use this function in LE machine. To avoid this redundant copy, I think we need to check machine endianness as in this patch. imaihal: We don't have to call this function in LE machines, but we can call this even in LE machines.
				rriddleUnsubmitted Done Reply Inline Actions Yeah, that's why I'd be okay with asserting inside of these functions. It could prevent some accidental usages of these functions when they shouldn't be used. Dense data gets extremely large. rriddle: Yeah, that's why I'd be okay with asserting inside of these functions. It could prevent some…
				/// the elements of `inRawData` has `type`. If `inRawData` is little endian
				/// (LE), it is converted to big endian (BE). Conversely, if `inRawData` is
				rriddleUnsubmitted Done Reply Inline Actions Can you rename the function to something a bit longer and more descriptive? I don't expect this function to be called very often, if ever by users, so a longer function name isn't that much of a detriment. rriddle: Can you rename the function to something a bit longer and more descriptive? I don't expect this…
				/// BE, converted to LE.
				static void
				convertEndianOfArrayRefForBEmachine(ArrayRef<char> inRawData,
				MutableArrayRef<char> outRawData,
				ShapedType type);

				/// Convert endianess of input for big-endian(BE) machines. The number of
				/// elements of `inRawData` is `numElements`, and each element has
				/// `elementBitWidth` bits. If `inRawData` is little endian (LE), it is
				/// converted to big endian (BE) and saved in `outRawData`. Conversely, if
				/// `inRawData` is BE, converted to LE.
				static void convertEndianOfCharForBEmachine(const char *inRawData,
				char *outRawData,
				size_t elementBitWidth,
				size_t numElements);

	protected:			protected:
	friend DenseElementsAttr;			friend DenseElementsAttr;

	/// Constructs a dense elements attribute from an array of raw APFloat values.			/// Constructs a dense elements attribute from an array of raw APFloat values.
	/// Each APFloat value is expected to have the same bitwidth as the element			/// Each APFloat value is expected to have the same bitwidth as the element
	/// type of 'type'. 'type' must be a vector or tensor with static shape.			/// type of 'type'. 'type' must be a vector or tensor with static shape.
	static DenseElementsAttr getRaw(ShapedType type, size_t storageWidth,			static DenseElementsAttr getRaw(ShapedType type, size_t storageWidth,
	ArrayRef<APFloat> values, bool isSplat);			ArrayRef<APFloat> values, bool isSplat);
	▲ Show 20 Lines • Show All 537 Lines • Show Last 20 Lines

mlir/lib/IR/AsmPrinter.cpp

Show First 20 Lines • Show All 1,500 Lines • ▼ Show 20 Lines	void ModulePrinter::printDenseIntOrFPElementsAttr(DenseIntOrFPElementsAttr attr,
auto type = attr.getType();		auto type = attr.getType();
auto elementType = type.getElementType();		auto elementType = type.getElementType();

// Check to see if we should format this attribute as a hex string.		// Check to see if we should format this attribute as a hex string.
auto numElements = type.getNumElements();		auto numElements = type.getNumElements();
if (!attr.isSplat() && allowHex &&		if (!attr.isSplat() && allowHex &&
shouldPrintElementsAttrWithHex(numElements)) {		shouldPrintElementsAttrWithHex(numElements)) {
ArrayRef<char> rawData = attr.getRawData();		ArrayRef<char> rawData = attr.getRawData();
os << '"' << "0x" << llvm::toHex(StringRef(rawData.data(), rawData.size()))		if (llvm::support::endian::system_endianness() ==
		rriddleUnsubmitted Done Reply Inline Actions If this machine is already little-endian, can we remove the redundant copy? rriddle: If this machine is already little-endian, can we remove the redundant copy?
		imaihalAuthorUnsubmitted Done Reply Inline Actions I couldn't find good way without checking system_endianness to avoid redundant copy for little endian. Little endian code is the same with original one. imaihal: I couldn't find good way without checking system_endianness to avoid redundant copy for little…
		llvm::support::endianness::big) {
		// Convert endianess in big-endian(BE) machines. `rawData` is BE in BE
		// machines. It is converted here to print in LE format.
		SmallVector<char, 64> outDataVec(rawData.size());
		MutableArrayRef<char> convRawData(outDataVec);
		rriddleUnsubmitted Done Reply Inline Actions This doesn't look correct, you aren't initializing the SmallVector meaning that `data` is not guaranteed to be a valid storage pointer. You also don't need to manually construct a MutableArrayRef, it should be implicitly constructible from SmallVector. rriddle: This doesn't look correct, you aren't initializing the SmallVector meaning that `data` is not…
		rriddleUnsubmitted Done Reply Inline Actions I may be confused right now, but doesn't `convertEndianOfArrayRefForBEmachine` just convert from little to big and not big to little? rriddle: I may be confused right now, but doesn't `convertEndianOfArrayRefForBEmachine` just convert…
		imaihalAuthorUnsubmitted Done Reply Inline Actions This may be confusing, but this also converts big to little on BE machine. `copy_n` copies `inRawData`(`ulittle`) to `outRawData`(`uint`). This copy assumes `inRawData` is LE format. So, this copy_n always converts endianness on BE machine even in actual `inRawData` is BE format. Normally this is used when `inRawData` is LE format, but I reused to convert BE to LE. imaihal: This may be confusing, but this also converts big to little on BE machine. `copy_n` copies…
		rriddleUnsubmitted Done Reply Inline Actions Okay, that kind of makes sense now. rriddle: Okay, that kind of makes sense now.
		DenseIntOrFPElementsAttr::convertEndianOfArrayRefForBEmachine(
		rawData, convRawData, type);
		os << '"' << "0x"
		<< llvm::toHex(StringRef(convRawData.data(), convRawData.size()))
<< "\"";		<< "\"";
		} else {
		os << '"' << "0x"
		<< llvm::toHex(StringRef(rawData.data(), rawData.size())) << "\"";
		}

return;		return;
}		}

if (ComplexType complexTy = elementType.dyn_cast<ComplexType>()) {		if (ComplexType complexTy = elementType.dyn_cast<ComplexType>()) {
Type complexElementType = complexTy.getElementType();		Type complexElementType = complexTy.getElementType();
// Note: The if and else below had a common lambda function which invoked		// Note: The if and else below had a common lambda function which invoked
// printDenseElementsAttrImpl. This lambda was hitting a bug in gcc 9.1,9.2		// printDenseElementsAttrImpl. This lambda was hitting a bug in gcc 9.1,9.2
// and hence was replaced.		// and hence was replaced.
▲ Show 20 Lines • Show All 947 Lines • Show Last 20 Lines

mlir/lib/IR/Attributes.cpp

Show All 15 Lines
#include "mlir/IR/Types.h"		#include "mlir/IR/Types.h"
#include "mlir/Interfaces/DecodeAttributesInterfaces.h"		#include "mlir/Interfaces/DecodeAttributesInterfaces.h"
#include "llvm/ADT/Sequence.h"		#include "llvm/ADT/Sequence.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
#include "llvm/Support/Endian.h"		#include "llvm/Support/Endian.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::detail;		using namespace mlir::detail;

		rriddleUnsubmitted Done Reply Inline Actions Can you move these inside of convEndianBE? rriddle: Can you move these inside of convEndianBE?
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AttributeStorage		// AttributeStorage
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

AttributeStorage::AttributeStorage(Type type)		AttributeStorage::AttributeStorage(Type type)
: type(type.getAsOpaquePointer()) {}		: type(type.getAsOpaquePointer()) {}
AttributeStorage::AttributeStorage() : type(nullptr) {}		AttributeStorage::AttributeStorage() : type(nullptr) {}

▲ Show 20 Lines • Show All 1,080 Lines • ▼ Show 20 Lines	DenseIntOrFPElementsAttr::getRawIntOrFloat(ShapedType type, ArrayRef<char> data,
assert(		assert(
::isValidIntOrFloat(type.getElementType(), dataEltSize, isInt, isSigned));		::isValidIntOrFloat(type.getElementType(), dataEltSize, isInt, isSigned));

int64_t numElements = data.size() / dataEltSize;		int64_t numElements = data.size() / dataEltSize;
assert(numElements == 1 \|\| numElements == type.getNumElements());		assert(numElements == 1 \|\| numElements == type.getNumElements());
return getRaw(type, data, /isSplat=/numElements == 1);		return getRaw(type, data, /isSplat=/numElements == 1);
}		}

		void DenseIntOrFPElementsAttr::convertEndianOfCharForBEmachine(
		const char inRawData, char outRawData, size_t elementBitWidth,
		size_t numElements) {
		rriddleUnsubmitted Done Reply Inline Actions Add an assert that the current machine is big endian here and in the function below? rriddle: Add an assert that the current machine is big endian here and in the function below?
		using llvm::support::ulittle16_t;
		using llvm::support::ulittle32_t;
		using llvm::support::ulittle64_t;

		assert(llvm::support::endian::system_endianness() == // NOLINT
		llvm::support::endianness::big); // NOLINT
		imaihalAuthorUnsubmitted Done Reply Inline Actions Inserted `NOLINT` to avoid warning messages in clang-tidy. The warning message suggested to use `static_assert()` here, but it is not appropriate here. imaihal: Inserted `NOLINT` to avoid warning messages in clang-tidy. The warning message suggested to use…
		// NOLINT to avoid warning message about replacing by static_assert()

		// Following std::copy_n always converts endianness on BE machine.
		switch (elementBitWidth) {
		case 16: {
		const ulittle16_t *inRawDataPos =
		reinterpret_cast<const ulittle16_t *>(inRawData);
		uint16_t outDataPos = reinterpret_cast<uint16_t >(outRawData);
		std::copy_n(inRawDataPos, numElements, outDataPos);
		break;
		}
		case 32: {
		const ulittle32_t *inRawDataPos =
		reinterpret_cast<const ulittle32_t *>(inRawData);
		uint32_t outDataPos = reinterpret_cast<uint32_t >(outRawData);
		std::copy_n(inRawDataPos, numElements, outDataPos);
		break;
		}
		case 64: {
		const ulittle64_t *inRawDataPos =
		reinterpret_cast<const ulittle64_t *>(inRawData);
		uint64_t outDataPos = reinterpret_cast<uint64_t >(outRawData);
		std::copy_n(inRawDataPos, numElements, outDataPos);
		break;
		}
		rriddleUnsubmitted Done Reply Inline Actions Is this check necessary anymore? rriddle: Is this check necessary anymore?
		default: {
		size_t nBytes = elementBitWidth / CHAR_BIT;
		rriddleUnsubmitted Done Reply Inline Actions nit: Just use `size_t` for `i` to remove the cast on `nBytes`. Or switch both to `ssize_t`. rriddle: nit: Just use `size_t` for `i` to remove the cast on `nBytes`. Or switch both to `ssize_t`.
		for (size_t i = 0; i < nBytes; i++)
		std::copy_n(inRawData + (nBytes - 1 - i), numElements, outRawData + i);
		break;
		}
		}
		}
		rriddleUnsubmitted Done Reply Inline Actions Drop this empty return. rriddle: Drop this empty return.

		void DenseIntOrFPElementsAttr::convertEndianOfArrayRefForBEmachine(
		ArrayRef<char> inRawData, MutableArrayRef<char> outRawData,
		ShapedType type) {
		size_t numElements = type.getNumElements();
		Type elementType = type.getElementType();
		if (ComplexType complexTy = elementType.dyn_cast<ComplexType>()) {
		elementType = complexTy.getElementType();
		numElements = numElements * 2;
		}
		size_t elementBitWidth = getDenseElementStorageWidth(elementType);
		assert(numElements * elementBitWidth == inRawData.size() * CHAR_BIT &&
		inRawData.size() <= outRawData.size());
		convertEndianOfCharForBEmachine(inRawData.begin(), outRawData.begin(),
		elementBitWidth, numElements);
		rriddleUnsubmitted Done Reply Inline Actions Can you drop the `DenseIntOrFPElementsAttr::` here? rriddle: Can you drop the `DenseIntOrFPElementsAttr::` here?
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// DenseFPElementsAttr		// DenseFPElementsAttr
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

template <typename Fn, typename Attr>		template <typename Fn, typename Attr>
static ShapedType mappingHelper(Fn mapping, Attr &attr, ShapedType inType,		static ShapedType mappingHelper(Fn mapping, Attr &attr, ShapedType inType,
Type newElementType,		Type newElementType,
llvm::SmallVectorImpl<char> &data) {		llvm::SmallVectorImpl<char> &data) {
▲ Show 20 Lines • Show All 333 Lines • Show Last 20 Lines

mlir/lib/Parser/AttributeParser.cpp

Show All 10 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "Parser.h"		#include "Parser.h"
#include "mlir/IR/AffineMap.h"		#include "mlir/IR/AffineMap.h"
#include "mlir/IR/Dialect.h"		#include "mlir/IR/Dialect.h"
#include "mlir/IR/IntegerSet.h"		#include "mlir/IR/IntegerSet.h"
#include "mlir/IR/StandardTypes.h"		#include "mlir/IR/StandardTypes.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
		#include "llvm/Support/Endian.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::detail;		using namespace mlir::detail;

/// Parse an arbitrary attribute.		/// Parse an arbitrary attribute.
///		///
/// attribute-value ::= `unit`		/// attribute-value ::= `unit`
/// \| bool-literal		/// \| bool-literal
▲ Show 20 Lines • Show All 644 Lines • ▼ Show 20 Lines	DenseElementsAttr TensorLiteralParser::getStringAttr(llvm::SMLoc loc,
}		}

return DenseStringElementsAttr::get(type, stringRefValues);		return DenseStringElementsAttr::get(type, stringRefValues);
}		}

/// Build a Dense attribute with hex data for the given type.		/// Build a Dense attribute with hex data for the given type.
DenseElementsAttr TensorLiteralParser::getHexAttr(llvm::SMLoc loc,		DenseElementsAttr TensorLiteralParser::getHexAttr(llvm::SMLoc loc,
ShapedType type) {		ShapedType type) {
Type elementType = type.getElementType();		Type elementType = type.getElementType();
		rriddleUnsubmitted Done Reply Inline Actions Can you just use these inline and drop the `using`? I don't think it saves much. rriddle: Can you just use these inline and drop the `using`? I don't think it saves much.
if (!elementType.isIntOrIndexOrFloat() && !elementType.isa<ComplexType>()) {		if (!elementType.isIntOrIndexOrFloat() && !elementType.isa<ComplexType>()) {
p.emitError(loc)		p.emitError(loc)
<< "expected floating-point, integer, or complex element type, got "		<< "expected floating-point, integer, or complex element type, got "
<< elementType;		<< elementType;
return nullptr;		return nullptr;
}		}

std::string data;		std::string data;
if (parseElementAttrHexValues(p, hexStorage.getValue(), data))		if (parseElementAttrHexValues(p, hexStorage.getValue(), data))
return nullptr;		return nullptr;

ArrayRef<char> rawData(data.data(), data.size());		ArrayRef<char> rawData(data.data(), data.size());
bool detectedSplat = false;		bool detectedSplat = false;
if (!DenseElementsAttr::isValidRawBuffer(type, rawData, detectedSplat)) {		if (!DenseElementsAttr::isValidRawBuffer(type, rawData, detectedSplat)) {
p.emitError(loc) << "elements hex data size is invalid for provided type: "		p.emitError(loc) << "elements hex data size is invalid for provided type: "
<< type;		<< type;
return nullptr;		return nullptr;
}		}

		if (llvm::support::endian::system_endianness() ==
		rriddleUnsubmitted Done Reply Inline Actions If this machine is already little-endian, can we remove the redundant copy? rriddle: If this machine is already little-endian, can we remove the redundant copy?
		llvm::support::endianness::big) {
		// Convert endianess in big-endian(BE) machines. `rawData` is
		// little-endian(LE) because HEX in raw data of dense element attribute
		// is always LE format. It is converted into BE here to be used in BE
		// machines.
		rriddleUnsubmitted Done Reply Inline Actions Same comments here. rriddle: Same comments here.
		SmallVector<char, 64> outDataVec(rawData.size());
		MutableArrayRef<char> convRawData(outDataVec);
		DenseIntOrFPElementsAttr::convertEndianOfArrayRefForBEmachine(
		rawData, convRawData, type);
		return DenseElementsAttr::getFromRawBuffer(type, convRawData,
		detectedSplat);
		}

return DenseElementsAttr::getFromRawBuffer(type, rawData, detectedSplat);		return DenseElementsAttr::getFromRawBuffer(type, rawData, detectedSplat);
}		}

ParseResult TensorLiteralParser::parseElement() {		ParseResult TensorLiteralParser::parseElement() {
switch (p.getToken().getKind()) {		switch (p.getToken().getKind()) {
// Parse a boolean element.		// Parse a boolean element.
case Token::kw_true:		case Token::kw_true:
case Token::kw_false:		case Token::kw_false:
case Token::floatliteral:		case Token::floatliteral:
case Token::integer:		case Token::integer:
storage.emplace_back(/isNegative=/false, p.getToken());		storage.emplace_back(/isNegative=/false, p.getToken());
p.consumeToken();		p.consumeToken();
		rriddleUnsubmitted Done Reply Inline Actions nit: Drop the else after return here. rriddle: nit: Drop the else after return here.
break;		break;

// Parse a signed integer or a negative floating-point element.		// Parse a signed integer or a negative floating-point element.
case Token::minus:		case Token::minus:
p.consumeToken(Token::minus);		p.consumeToken(Token::minus);
if (!p.getToken().isAny(Token::floatliteral, Token::integer))		if (!p.getToken().isAny(Token::floatliteral, Token::integer))
return p.emitError("expected integer or floating point literal");		return p.emitError("expected integer or floating point literal");
storage.emplace_back(/isNegative=/true, p.getToken());		storage.emplace_back(/isNegative=/true, p.getToken());
▲ Show 20 Lines • Show All 250 Lines • Show Last 20 Lines

mlir/test/IR/dense-elements-hex.mlir

	// RUN: mlir-opt -allow-unregistered-dialect %s -verify-diagnostics -split-input-file -mlir-print-elementsattrs-with-hex-if-larger=1 \| FileCheck %s --check-prefix=HEX			// RUN: mlir-opt -allow-unregistered-dialect %s -verify-diagnostics -split-input-file -mlir-print-elementsattrs-with-hex-if-larger=1 \| FileCheck %s --check-prefix=HEX
	// RUN: mlir-opt -allow-unregistered-dialect %s -verify-diagnostics -split-input-file \| FileCheck %s			// RUN: mlir-opt -allow-unregistered-dialect %s -verify-diagnostics -split-input-file \| FileCheck %s

				// HEX: dense<"0x000020410000A040"> : tensor<2xf32>
				"foo.op"() {dense.attr = dense<[10.0, 5.0]> : tensor<2xf32>} : () -> ()

	// HEX: dense<"0x00000000000024400000000000001440"> : tensor<2xf64>			// HEX: dense<"0x00000000000024400000000000001440"> : tensor<2xf64>
	"foo.op"() {dense.attr = dense<[10.0, 5.0]> : tensor<2xf64>} : () -> ()			"foo.op"() {dense.attr = dense<[10.0, 5.0]> : tensor<2xf64>} : () -> ()

				// CHECK: dense<[1.000000e+01, 5.000000e+00]> : tensor<2xf32>
				"foo.op"() {dense.attr = dense<"0x000020410000A040"> : tensor<2xf32>} : () -> ()

	// CHECK: dense<[1.000000e+01, 5.000000e+00]> : tensor<2xf64>			// CHECK: dense<[1.000000e+01, 5.000000e+00]> : tensor<2xf64>
	"foo.op"() {dense.attr = dense<"0x00000000000024400000000000001440"> : tensor<2xf64>} : () -> ()			"foo.op"() {dense.attr = dense<"0x00000000000024400000000000001440"> : tensor<2xf64>} : () -> ()

				// CHECK: dense<(1.000000e+01,5.000000e+00)> : tensor<2xcomplex<f32>>
				"foo.op"() {dense.attr = dense<"0x000020410000A040000020410000A040"> : tensor<2xcomplex<f32>>} : () -> ()

	// CHECK: dense<(1.000000e+01,5.000000e+00)> : tensor<2xcomplex<f64>>			// CHECK: dense<(1.000000e+01,5.000000e+00)> : tensor<2xcomplex<f64>>
	"foo.op"() {dense.attr = dense<"0x0000000000002440000000000000144000000000000024400000000000001440"> : tensor<2xcomplex<f64>>} : () -> ()			"foo.op"() {dense.attr = dense<"0x0000000000002440000000000000144000000000000024400000000000001440"> : tensor<2xcomplex<f64>>} : () -> ()

	// CHECK: dense<[1.000000e+01, 5.000000e+00]> : tensor<2xbf16>			// CHECK: dense<[1.000000e+01, 5.000000e+00]> : tensor<2xbf16>
	"foo.op"() {dense.attr = dense<"0x2041A040"> : tensor<2xbf16>} : () -> ()			"foo.op"() {dense.attr = dense<"0x2041A040"> : tensor<2xbf16>} : () -> ()

	// -----			// -----

	Show All 12 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Convert raw data in dense element attributes for big-endian machines.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 301484

mlir/include/mlir/IR/Attributes.h

mlir/lib/IR/AsmPrinter.cpp

mlir/lib/IR/Attributes.cpp

mlir/lib/Parser/AttributeParser.cpp

mlir/test/IR/dense-elements-hex.mlir

[mlir] Convert raw data in dense element attributes for big-endian machines.
ClosedPublic