This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/RISCV/
-
Target/
-
RISCV/
1/3
RISCV.td
4/10
RISCVISelLowering.cpp
-
RISCVSubtarget.h
-
test/CodeGen/RISCV/
-
CodeGen/
-
RISCV/
-
unaligned-load-store.ll

Differential D126085

[RISCV] Add a subtarget feature to enable unaligned scalar loads and stores
ClosedPublic

Authored by reames on May 20 2022, 12:32 PM.

Download Raw Diff

Details

Reviewers

craig.topper
asb
frasercrmck
kito-cheng

Commits

rG8a3b6ba7569e: [RISCV] Add a subtarget feature to enable unaligned scalar loads and stores

Summary

A RISCV implementation can choose to implement unaligned load/store support. We currently don't have a way for such a processor to indicate a preference for unaligned load/stores, so add a subtarget feature.

There doesn't appear to be a formal extension for unaligned support. The RISCV Profiles (https://github.com/riscv/riscv-profiles/blob/main/profiles.adoc#rva20u64-profile) docs use the name Zicclsm, but a) that doesn't appear to actually been standardized, and b) isn't quite what we want here anyway due to the perf comment.

Instead, we can follow precedent from other backends and have a feature flag for the existence of misaligned load/stores with sufficient performance that user code should actually use them.

Diff Detail

Unit TestsFailed

	Time	Test
	60,490 ms	x64 debian > LLVM.tools/llvm-isel-fuzzer::aarch64-execname-options.ll
	60,540 ms	x64 debian > LLVM.tools/llvm-isel-fuzzer::execname-options.ll
	60,190 ms	x64 debian > LLVM.tools/llvm-objcopy/ELF::auto-remove-add-symtab-shndx.test
	60,280 ms	x64 debian > LLVM.tools/llvm-opt-fuzzer::exec-options.ll
	60,090 ms	x64 debian > LeakSanitizer-Standalone-x86_64.TestCases::many_threads_detach.cpp
		View Full Test Results (10 Failed)

Event Timeline

reames created this revision.May 20 2022, 12:32 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 20 2022, 12:32 PM

Herald added subscribers: sunshaoce, VincentWu, luke957 and 29 others. · View Herald Transcript

reames requested review of this revision.May 20 2022, 12:32 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 20 2022, 12:32 PM

Herald added subscribers: • pcwang-thead, eopXD, MaskRay. · View Herald Transcript

craig.topper added inline comments.May 20 2022, 12:57 PM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11820	This seems like it should be a different feature for vectors.

reames added inline comments.May 20 2022, 1:09 PM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11820	There might be such a target where scalar vs vector matters, but on the motivating case, this is not expected to matter. We can split later if needed.

Harbormaster completed remote builds in B165564: Diff 431026.May 20 2022, 1:10 PM

Is this meant to be "it works" or "it works without trapping for emulation"? Pretty much every EEI out there has misaligned accesses guaranteed to work, just not quickly, and in those cases you'd still want to avoid them as the inlined byte-wise code is far faster.

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11810	If returning true and Fast isn't null we should be setting it; some users pre-initialise it, but not all (I found one in GlobalISel and stopped looking)

craig.topper added inline comments.May 20 2022, 1:14 PM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11820	There are no vector tests and the description in the .td file says “scalar”. The patch should at least be self consistent.

reames added inline comments.May 20 2022, 1:22 PM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11810	Good catch. I will unconditionally initialize to false to make sure we don't miss any paths. Thanks!
11820	You're absolutely correct on that. I'd originally had it split, then changed my mind. Let me rebase and correct both of those!

craig.topper added inline comments.May 20 2022, 1:39 PM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11816	Not directly related to this patch, but it could be argued that *Fast = true is incorrect for vectors here. Being element aligned doesn't guarantee optimal access if you're splitting a cache line or other alignment requirements of your memory unit.

In D126085#3528394, @jrtc27 wrote:

Is this meant to be "it works" or "it works without trapping for emulation"? Pretty much every EEI out there has misaligned accesses guaranteed to work, just not quickly, and in those cases you'd still want to avoid them as the inlined byte-wise code is far faster.

From what I can tell looking at existing targets, the three states for unaligned are basically.

"not present" - either the hardware doesn't support, or performance is unreasonable
"present" - hardware is present, and reasonable performant. unaligned accesses in source are lowered as such.
"fast" - hardware is present, optimizer should use when doesn't exist in code already

I tried to follow that convention with this change.

address review comments

reames added inline comments.May 20 2022, 2:32 PM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11820	It turns out writing vector tests was a useful exercise. :) From what I can tell, we aggressively canonicalize towards byte element loads and stores for vectors. As such, no plain vector load or store is every treated as unaligned. However, some indexed and strided load tests did fail if I asserted the fallthrough return was never taken. For the moment, I decided to restrict the scope to only the scalar case. Mostly because there look to be enough other opportunities around indexed loads that this didn't seem terribly useful. With that, I could go in two directions here. Rename this to scalar and add a separate flag later for vector cases as warranted. Document the intent that this covers all access, but leave the indexed/strided case currently unimplemented. I have no strong preference and will do as reviewers request.

Harbormaster completed remote builds in B165587: Diff 431056.May 20 2022, 3:17 PM

asb added inline comments.May 25 2022, 7:31 AM

llvm/lib/Target/RISCV/RISCV.td
435	Nit: reasonable => reasonably

reames marked an inline comment as not done.May 25 2022, 11:01 AM

reames added inline comments.

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11820	@craig.topper ping?

craig.topper added inline comments.May 25 2022, 11:44 AM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
11820	Let's rename it to scalar and add a separate flag for vector later as warranted.

Revise per reviewer request

craig.topper added inline comments.May 25 2022, 12:07 PM

llvm/lib/Target/RISCV/RISCV.td
435	asb's comment still needs to be addressed.

reames added inline comments.May 25 2022, 12:34 PM

llvm/lib/Target/RISCV/RISCV.td
435	Oops, yep, will rev.

address missed review comment

I don't think it affects this patch (as splitting scalar vs vector features for alignment makes sense regardless of the frontend option), but I had a quick look at the frontend option for this. Looks like on the GCC side it's -mno-strict-align. On Arm/AArch64 it's spelled -munaligned-access and __ARM_FEATURE_UNALIGNED is set. Unless there's some target generic #define that's set that I'm missing, it feels like it would be useful to set a define for the RISC-V case as well (and agree this with the GNU folks).

One thing I noticed that does affect this patch (and sorry I didn't spot this earlier!), is we should be setting Tag_RISCV_unaligned_access when this feature is set (see here).

In D126085#3538293, @asb wrote:

I don't think it affects this patch (as splitting scalar vs vector features for alignment makes sense regardless of the frontend option), but I had a quick look at the frontend option for this. Looks like on the GCC side it's -mno-strict-align. On Arm/AArch64 it's spelled -munaligned-access and __ARM_FEATURE_UNALIGNED is set. Unless there's some target generic #define that's set that I'm missing, it feels like it would be useful to set a define for the RISC-V case as well (and agree this with the GNU folks).

I agree this is separate. What a target defaults to should not be blocked on what a frontend might chose to override. I'll note that these flags don't appear to influence x86 handling of the corresponding cases, so unless this is handled somewhere generically I missed, this isn't RISCV specific in any way.

One thing I noticed that does affect this patch (and sorry I didn't spot this earlier!), is we should be setting Tag_RISCV_unaligned_access when this feature is set (see here).

Er, I'm sorry, what? Why?

I agree this spec exists, but what purpose does the attribute bit actually serve? There's no interaction with the dynamic loader here. It's a compiler codegen choice.

If you really want, I'm fine adding the attribute wiring (in a separate patch), but this really looks like a spec that is a) out of sync with reality, and b) serves no obvious purpose.

In D126085#3538325, @reames wrote:

In D126085#3538293, @asb wrote:

I don't think it affects this patch (as splitting scalar vs vector features for alignment makes sense regardless of the frontend option), but I had a quick look at the frontend option for this. Looks like on the GCC side it's -mno-strict-align. On Arm/AArch64 it's spelled -munaligned-access and __ARM_FEATURE_UNALIGNED is set. Unless there's some target generic #define that's set that I'm missing, it feels like it would be useful to set a define for the RISC-V case as well (and agree this with the GNU folks).

I agree this is separate. What a target defaults to should not be blocked on what a frontend might chose to override. I'll note that these flags don't appear to influence x86 handling of the corresponding cases, so unless this is handled somewhere generically I missed, this isn't RISCV specific in any way.

Yep, just mentioning for future reference.

One thing I noticed that does affect this patch (and sorry I didn't spot this earlier!), is we should be setting Tag_RISCV_unaligned_access when this feature is set (see here).

Er, I'm sorry, what? Why?

I agree this spec exists, but what purpose does the attribute bit actually serve? There's no interaction with the dynamic loader here. It's a compiler codegen choice.

If you really want, I'm fine adding the attribute wiring (in a separate patch), but this really looks like a spec that is a) out of sync with reality, and b) serves no obvious purpose.

I'm not sure the attribute is used in the wild in any meaningful way right now, but I think it does have purpose. There's no guarantee that a RISC-V EEI actually provides a trap handler for misaligned accesses, so on some targets (largely embedded), compiling with +unaligned-scalar-mem may result in code that is unsuitable. Given there's no way to describe support for misaligned accesses in the ISA naming string, it's probably similarly useful to the other attributes from the perspective of any tooling working with ELF files. Such tooling might be used to warn / error when a likely build misconfiguration is found within a project.

If I were authoring the patch, I'd likely put the attribute wiring in the same patch (or perhaps if I really wanted to split things, have patch 1 adding the feature but only impacting the attribute, and patch 2 adding codegen tests and changes). But tastes vary and I don't think it makes a big difference for this patch, so I'm happy with whatever sequencing you prefer here.

Harbormaster completed remote builds in B166337: Diff 432085.May 25 2022, 1:53 PM

Technically it's may not will so you can just hard-wire it to 1...

This is all that's needed to hook up the appropriate attribute:

diff --git a/llvm/lib/Target/RISCV/MCTargetDesc/RISCVTargetStreamer.cpp b/llvm/lib/Target/RISCV/MCTargetDesc/RISCVTargetStreamer.cpp
index 5f9ed77d07cf..ac0c8113135a 100644
--- a/llvm/lib/Target/RISCV/MCTargetDesc/RISCVTargetStreamer.cpp
+++ b/llvm/lib/Target/RISCV/MCTargetDesc/RISCVTargetStreamer.cpp
@@ -50,6 +50,9 @@ void RISCVTargetStreamer::emitTargetAttributes(const MCSubtargetInfo &STI) {
   else
     emitAttribute(RISCVAttrs::STACK_ALIGN, RISCVAttrs::ALIGN_16);
 
+  if (STI.hasFeature(RISCV::FeatureUnalignedScalarMem))
+    emitAttribute(RISCVAttrs::UNALIGNED_ACCESS, RISCVAttrs::ALLOWED);
+
   auto ParseResult = RISCVFeatures::parseFeatureBits(
       STI.hasFeature(RISCV::Feature64Bit), STI.getFeatureBits());
   if (!ParseResult) {
diff --git a/llvm/test/CodeGen/RISCV/attributes.ll b/llvm/test/CodeGen/RISCV/attributes.ll
index 9d9f02ce52cb..0734b5a01b45 100644
--- a/llvm/test/CodeGen/RISCV/attributes.ll
+++ b/llvm/test/CodeGen/RISCV/attributes.ll
@@ -147,6 +147,14 @@
 ; RV64COMBINEINTOZKN: .attribute 5, "rv64i2p0_zbkb1p0_zbkc1p0_zbkx1p0_zkn1p0_zknd1p0_zkne1p0_zknh1p0"
 ; RV64COMBINEINTOZKS: .attribute 5, "rv64i2p0_zbkb1p0_zbkc1p0_zbkx1p0_zks1p0_zksed1p0_zksh1p0"
 
+; RUN: llc -mtriple=riscv32 %s -o - | FileCheck --check-prefix=ALIGNED %s
+; RUN: llc -mtriple=riscv64 %s -o - | FileCheck --check-prefix=ALIGNED %s
+; RUN: llc -mtriple=riscv32 -mattr=+unaligned-scalar-mem %s -o - | FileCheck --check-prefix=UNALIGNED %s
+; RUN: llc -mtriple=riscv64 -mattr=+unaligned-scalar-mem %s -o - | FileCheck --check-prefix=UNALIGNED %s
+
+; ALIGNED-NOT: .attribute 6
+; UNALIGNED: .attribute 6, 1
+
 define i32 @addi(i32 %a) {
   %1 = add i32 %a, 1
   ret i32 %1

I think not emitting Tag_RISCV_unaligned_access in the absence of +unaligned-scalar-mem (as the above patch does) is probably the best we can do, as inline assembly or handwritten .s files may well include misaligned accesses, even without +unaligned-scalar-mem.

For the patch under review, ALIGN and NOALIGN are (to my eyes) reversed in meaning. i.e. I'd expect NOALIGN to cover RUN invocations with -mattr=+unaligned-scalar-mem. But maybe there's an alternate interpretation?

The topic of Tag_RISCV_unaligned_access handling was discussed in today's risc-v sync up call. The conclusion of that discussion is that we're going to leave it unset in this patch, and that I'm going to file an issue against the psABI to clarify a couple points in the spec. Once that clarification has been done, we may revisit setting the tag.

In the short run, we believe that leaving the tag in the unset state is the lowest risk strategy, and the least likely to break assumptions which may have been made about the meaning of the tag.

As just discussed in the sync-up call meeting, let's defer the question of setting / not setting the attribute to an upstream discussion in riscv-elf-psabi-doc on the precise semantics of the tag. Philip kindly volunteered to kick off an issue in that repo.

I don't think setting the tag or not needs to block this landing (though the degree to which ELF tags are opt-in is perhaps something that could be further clarified in the psABI doc too).

(EDIT: Ah, Philip beat me to it!)

Address test check comment.

Anything remaining blocking this? I know I need to file the psABI issue and will do so before pushing. Asking about further review items.

In D126085#3540577, @reames wrote:

Address test check comment.

Anything remaining blocking this? I know I need to file the psABI issue and will do so before pushing. Asking about further review items.

Nothing remaining from me - LGTM.

This revision is now accepted and ready to land.May 26 2022, 10:17 AM

Harbormaster completed remote builds in B166505: Diff 432325.May 26 2022, 11:07 AM

Here is the relevant psABI issue: https://github.com/riscv-non-isa/riscv-elf-psabi-doc/issues/283

This revision was landed with ongoing or failed builds.May 26 2022, 3:56 PM

Closed by commit rG8a3b6ba7569e: [RISCV] Add a subtarget feature to enable unaligned scalar loads and stores (authored by reames). · Explain Why

This revision was automatically updated to reflect the committed changes.

reames added a commit: rG8a3b6ba7569e: [RISCV] Add a subtarget feature to enable unaligned scalar loads and stores.

How to enable this feature through clang? I tried this:

$ ./bin/clang -S t.c -O2 -mllvm -mattr=+unaligned-scalar-mem

But get:
clang (LLVM option parsing): Unknown command line argument '-mattr=+unaligned-scalar-mem'. Try: 'clang (LLVM option parsing) --help'
clang (LLVM option parsing): Did you mean '--mv67t=+unaligned-scalar-mem'?

In D126085#3556502, @BaoshanPang wrote:

How to enable this feature through clang? I tried this:

$ ./bin/clang -S t.c -O2 -mllvm -mattr=+unaligned-scalar-mem

But get:
clang (LLVM option parsing): Unknown command line argument '-mattr=+unaligned-scalar-mem'. Try: 'clang (LLVM option parsing) --help'
clang (LLVM option parsing): Did you mean '--mv67t=+unaligned-scalar-mem'?

Try -Xclang -target-feature -Xclang +unaligned-scalar-mem

In D126085#3556528, @craig.topper wrote:

In D126085#3556502, @BaoshanPang wrote:

How to enable this feature through clang? I tried this:

$ ./bin/clang -S t.c -O2 -mllvm -mattr=+unaligned-scalar-mem

But get:
clang (LLVM option parsing): Unknown command line argument '-mattr=+unaligned-scalar-mem'. Try: 'clang (LLVM option parsing) --help'
clang (LLVM option parsing): Did you mean '--mv67t=+unaligned-scalar-mem'?

Try -Xclang -target-feature -Xclang +unaligned-scalar-mem

Thanks for the qucik response, it works for me.

reames mentioned this in D149375: [RISCV] Introduce unaligned-vector-mem feature.Apr 27 2023, 12:22 PM

reames mentioned this in rGd636bcb6ae51: [RISCV] Introduce unaligned-vector-mem feature.Apr 28 2023, 8:28 AM

asb mentioned this in D150717: [RISCV] Use scalar stores for splats of zero to memory up to XLen.May 17 2023, 4:14 AM

luke mentioned this in D150771: [RISCV] Set Fast flag for unaligned memory accesses.May 17 2023, 5:14 AM

luke mentioned this in rG5b95bba6fe41: [RISCV] Set Fast flag for unaligned memory accesses.Jul 24 2023, 2:59 AM

Revision Contents

Path

Size

llvm/

lib/

Target/

RISCV/

RISCV.td

5 lines

RISCVISelLowering.cpp

5 lines

RISCVSubtarget.h

2 lines

test/

CodeGen/

RISCV/

unaligned-load-store.ll

164 lines

Diff 431026

llvm/lib/Target/RISCV/RISCV.td

	Show First 20 Lines • Show All 424 Lines • ▼ Show 20 Lines
	foreach i = {1-31} in			foreach i = {1-31} in
	def FeatureReserveX#i :			def FeatureReserveX#i :
	SubtargetFeature<"reserve-x"#i, "UserReservedRegister[RISCV::X"#i#"]",			SubtargetFeature<"reserve-x"#i, "UserReservedRegister[RISCV::X"#i#"]",
	"true", "Reserve X"#i>;			"true", "Reserve X"#i>;

	def FeatureSaveRestore : SubtargetFeature<"save-restore", "EnableSaveRestore",			def FeatureSaveRestore : SubtargetFeature<"save-restore", "EnableSaveRestore",
	"true", "Enable save/restore.">;			"true", "Enable save/restore.">;

				def FeatureUnalignedMem
				: SubtargetFeature<"unaligned-mem", "EnableUnalignedMem",
				"true", "Implements unaligned scalar loads and stores">;
				asbUnsubmitted Not Done Reply Inline Actions Nit: reasonable => reasonably asb: Nit: reasonable => reasonably
				craig.topperUnsubmitted Not Done Reply Inline Actions asb's comment still needs to be addressed. craig.topper: asb's comment still needs to be addressed.
				reamesAuthorUnsubmitted Done Reply Inline Actions Oops, yep, will rev. reames: Oops, yep, will rev.


	def TuneNoDefaultUnroll			def TuneNoDefaultUnroll
	: SubtargetFeature<"no-default-unroll", "EnableDefaultUnroll", "false",			: SubtargetFeature<"no-default-unroll", "EnableDefaultUnroll", "false",
	"Disable default unroll preference.">;			"Disable default unroll preference.">;

	def TuneSiFive7 : SubtargetFeature<"sifive7", "RISCVProcFamily", "SiFive7",			def TuneSiFive7 : SubtargetFeature<"sifive7", "RISCVProcFamily", "SiFive7",
	"SiFive 7-Series processors">;			"SiFive 7-Series processors">;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

llvm/lib/Target/RISCV/RISCVISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 11,801 Lines • ▼ Show 20 Lines	bool RISCVTargetLowering::isMulAddWithConstProfitable(SDValue AddNode,
// Default to true and let the DAGCombiner decide.		// Default to true and let the DAGCombiner decide.
return true;		return true;
}		}

bool RISCVTargetLowering::allowsMisalignedMemoryAccesses(		bool RISCVTargetLowering::allowsMisalignedMemoryAccesses(
EVT VT, unsigned AddrSpace, Align Alignment, MachineMemOperand::Flags Flags,		EVT VT, unsigned AddrSpace, Align Alignment, MachineMemOperand::Flags Flags,
bool *Fast) const {		bool *Fast) const {
if (!VT.isVector())		if (!VT.isVector())
return false;		return Subtarget.enableUnalignedMem();
		jrtc27Unsubmitted Not Done Reply Inline Actions If returning true and Fast isn't null we should be setting it; some users pre-initialise it, but not all (I found one in GlobalISel and stopped looking) jrtc27: If returning true and Fast isn't null we should be setting it; some users pre-initialise it…
		reamesAuthorUnsubmitted Done Reply Inline Actions Good catch. I will unconditionally initialize to false to make sure we don't miss any paths. Thanks! reames: Good catch. I will unconditionally initialize to false to make sure we don't miss any paths.

		// All vector implementations must support element alignment
EVT ElemVT = VT.getVectorElementType();		EVT ElemVT = VT.getVectorElementType();
if (Alignment >= ElemVT.getStoreSize()) {		if (Alignment >= ElemVT.getStoreSize()) {
if (Fast)		if (Fast)
*Fast = true;		*Fast = true;
		craig.topperUnsubmitted Not Done Reply Inline Actions Not directly related to this patch, but it could be argued that Fast = true is incorrect for vectors here. Being element aligned doesn't guarantee optimal access if you're splitting a cache line or other alignment requirements of your memory unit. craig.topper:* Not directly related to this patch, but it could be argued that *Fast = true is incorrect for…
return true;		return true;
}		}

return false;		return Subtarget.enableUnalignedMem();
		craig.topperUnsubmitted Not Done Reply Inline Actions This seems like it should be a different feature for vectors. craig.topper: This seems like it should be a different feature for vectors.
		reamesAuthorUnsubmitted Done Reply Inline Actions There might be such a target where scalar vs vector matters, but on the motivating case, this is not expected to matter. We can split later if needed. reames: There might be such a target where scalar vs vector matters, but on the motivating case, this…
		craig.topperUnsubmitted Not Done Reply Inline Actions There are no vector tests and the description in the .td file says “scalar”. The patch should at least be self consistent. craig.topper: There are no vector tests and the description in the .td file says “scalar”. The patch should…
		reamesAuthorUnsubmitted Done Reply Inline Actions You're absolutely correct on that. I'd originally had it split, then changed my mind. Let me rebase and correct both of those! reames: You're absolutely correct on that. I'd originally had it split, then changed my mind. Let me…
		reamesAuthorUnsubmitted Not Done Reply Inline Actions It turns out writing vector tests was a useful exercise. :) From what I can tell, we aggressively canonicalize towards byte element loads and stores for vectors. As such, no plain vector load or store is every treated as unaligned. However, some indexed and strided load tests did fail if I asserted the fallthrough return was never taken. For the moment, I decided to restrict the scope to only the scalar case. Mostly because there look to be enough other opportunities around indexed loads that this didn't seem terribly useful. With that, I could go in two directions here. Rename this to scalar and add a separate flag later for vector cases as warranted. Document the intent that this covers all access, but leave the indexed/strided case currently unimplemented. I have no strong preference and will do as reviewers request. reames: It turns out writing vector tests was a useful exercise. :) From what I can tell, we…
		reamesAuthorUnsubmitted Done Reply Inline Actions @craig.topper ping? reames: @craig.topper ping?
		craig.topperUnsubmitted Not Done Reply Inline Actions Let's rename it to scalar and add a separate flag for vector later as warranted. craig.topper: Let's rename it to scalar and add a separate flag for vector later as warranted.
}		}

bool RISCVTargetLowering::splitValueIntoRegisterParts(		bool RISCVTargetLowering::splitValueIntoRegisterParts(
SelectionDAG &DAG, const SDLoc &DL, SDValue Val, SDValue *Parts,		SelectionDAG &DAG, const SDLoc &DL, SDValue Val, SDValue *Parts,
unsigned NumParts, MVT PartVT, Optional<CallingConv::ID> CC) const {		unsigned NumParts, MVT PartVT, Optional<CallingConv::ID> CC) const {
bool IsABIRegCopy = CC.hasValue();		bool IsABIRegCopy = CC.hasValue();
EVT ValueVT = Val.getValueType();		EVT ValueVT = Val.getValueType();
if (IsABIRegCopy && ValueVT == MVT::f16 && PartVT == MVT::f32) {		if (IsABIRegCopy && ValueVT == MVT::f16 && PartVT == MVT::f32) {
▲ Show 20 Lines • Show All 175 Lines • Show Last 20 Lines

llvm/lib/Target/RISCV/RISCVSubtarget.h

Show First 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	private:
bool HasStdExtZkt = false;		bool HasStdExtZkt = false;
bool HasStdExtZk = false;		bool HasStdExtZk = false;
bool HasRV64 = false;		bool HasRV64 = false;
bool IsRV32E = false;		bool IsRV32E = false;
bool EnableLinkerRelax = false;		bool EnableLinkerRelax = false;
bool EnableRVCHintInstrs = true;		bool EnableRVCHintInstrs = true;
bool EnableDefaultUnroll = true;		bool EnableDefaultUnroll = true;
bool EnableSaveRestore = false;		bool EnableSaveRestore = false;
		bool EnableUnalignedMem = false;
unsigned XLen = 32;		unsigned XLen = 32;
unsigned ZvlLen = 0;		unsigned ZvlLen = 0;
MVT XLenVT = MVT::i32;		MVT XLenVT = MVT::i32;
uint8_t MaxInterleaveFactor = 2;		uint8_t MaxInterleaveFactor = 2;
RISCVABI::ABI TargetABI = RISCVABI::ABI_Unknown;		RISCVABI::ABI TargetABI = RISCVABI::ABI_Unknown;
BitVector UserReservedRegister;		BitVector UserReservedRegister;
RISCVFrameLowering FrameLowering;		RISCVFrameLowering FrameLowering;
RISCVInstrInfo InstrInfo;		RISCVInstrInfo InstrInfo;
▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines	public:
bool hasStdExtZksh() const { return HasStdExtZksh; }		bool hasStdExtZksh() const { return HasStdExtZksh; }
bool hasStdExtZkr() const { return HasStdExtZkr; }		bool hasStdExtZkr() const { return HasStdExtZkr; }
bool is64Bit() const { return HasRV64; }		bool is64Bit() const { return HasRV64; }
bool isRV32E() const { return IsRV32E; }		bool isRV32E() const { return IsRV32E; }
bool enableLinkerRelax() const { return EnableLinkerRelax; }		bool enableLinkerRelax() const { return EnableLinkerRelax; }
bool enableRVCHintInstrs() const { return EnableRVCHintInstrs; }		bool enableRVCHintInstrs() const { return EnableRVCHintInstrs; }
bool enableDefaultUnroll() const { return EnableDefaultUnroll; }		bool enableDefaultUnroll() const { return EnableDefaultUnroll; }
bool enableSaveRestore() const { return EnableSaveRestore; }		bool enableSaveRestore() const { return EnableSaveRestore; }
		bool enableUnalignedMem() const { return EnableUnalignedMem; }
MVT getXLenVT() const { return XLenVT; }		MVT getXLenVT() const { return XLenVT; }
unsigned getXLen() const { return XLen; }		unsigned getXLen() const { return XLen; }
unsigned getFLen() const {		unsigned getFLen() const {
if (HasStdExtD)		if (HasStdExtD)
return 64;		return 64;

if (HasStdExtF)		if (HasStdExtF)
return 32;		return 32;
▲ Show 20 Lines • Show All 69 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/unaligned-load-store.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -mtriple=riscv32 -verify-machineinstrs < %s \			; RUN: llc -mtriple=riscv32 -verify-machineinstrs < %s \
	; RUN: \| FileCheck -check-prefixes=BOTH,RV32I %s			; RUN: \| FileCheck -check-prefixes=ALL,NOALIGN,RV32I %s
	; RUN: llc -mtriple=riscv64 -verify-machineinstrs < %s \			; RUN: llc -mtriple=riscv64 -verify-machineinstrs < %s \
	; RUN: \| FileCheck -check-prefixes=BOTH,RV64I %s			; RUN: \| FileCheck -check-prefixes=ALL,NOALIGN,RV64I %s
				; RUN: llc -mtriple=riscv32 -mattr=+unaligned-mem -verify-machineinstrs < %s \
				; RUN: \| FileCheck -check-prefixes=ALL,ALIGN,ALIGN-RV32I %s
				; RUN: llc -mtriple=riscv64 -mattr=+unaligned-mem -verify-machineinstrs < %s \
				; RUN: \| FileCheck -check-prefixes=ALL,ALIGN,ALIGN-RV64I %s

	; A collection of cases showing codegen for unaligned loads and stores			; A collection of cases showing codegen for unaligned loads and stores

	define i8 @load_i8(i8* %p) {			define i8 @load_i8(i8* %p) {
	; BOTH-LABEL: load_i8:			; ALL-LABEL: load_i8:
	; BOTH: # %bb.0:			; ALL: # %bb.0:
	; BOTH-NEXT: lb a0, 0(a0)			; ALL-NEXT: lb a0, 0(a0)
	; BOTH-NEXT: ret			; ALL-NEXT: ret
	%res = load i8, i8* %p, align 1			%res = load i8, i8* %p, align 1
	ret i8 %res			ret i8 %res
	}			}

	define i16 @load_i16(i16* %p) {			define i16 @load_i16(i16* %p) {
	; BOTH-LABEL: load_i16:			; NOALIGN-LABEL: load_i16:
	; BOTH: # %bb.0:			; NOALIGN: # %bb.0:
	; BOTH-NEXT: lb a1, 1(a0)			; NOALIGN-NEXT: lb a1, 1(a0)
	; BOTH-NEXT: lbu a0, 0(a0)			; NOALIGN-NEXT: lbu a0, 0(a0)
	; BOTH-NEXT: slli a1, a1, 8			; NOALIGN-NEXT: slli a1, a1, 8
	; BOTH-NEXT: or a0, a1, a0			; NOALIGN-NEXT: or a0, a1, a0
	; BOTH-NEXT: ret			; NOALIGN-NEXT: ret
				;
				; ALIGN-LABEL: load_i16:
				; ALIGN: # %bb.0:
				; ALIGN-NEXT: lh a0, 0(a0)
				; ALIGN-NEXT: ret
	%res = load i16, i16* %p, align 1			%res = load i16, i16* %p, align 1
	ret i16 %res			ret i16 %res
	}			}

	define i24 @load_i24(i24* %p) {			define i24 @load_i24(i24* %p) {
	; BOTH-LABEL: load_i24:			; NOALIGN-LABEL: load_i24:
	; BOTH: # %bb.0:			; NOALIGN: # %bb.0:
	; BOTH-NEXT: lbu a1, 1(a0)			; NOALIGN-NEXT: lbu a1, 1(a0)
	; BOTH-NEXT: lbu a2, 0(a0)			; NOALIGN-NEXT: lbu a2, 0(a0)
	; BOTH-NEXT: lb a0, 2(a0)			; NOALIGN-NEXT: lb a0, 2(a0)
	; BOTH-NEXT: slli a1, a1, 8			; NOALIGN-NEXT: slli a1, a1, 8
	; BOTH-NEXT: or a1, a1, a2			; NOALIGN-NEXT: or a1, a1, a2
	; BOTH-NEXT: slli a0, a0, 16			; NOALIGN-NEXT: slli a0, a0, 16
	; BOTH-NEXT: or a0, a1, a0			; NOALIGN-NEXT: or a0, a1, a0
	; BOTH-NEXT: ret			; NOALIGN-NEXT: ret
				;
				; ALIGN-LABEL: load_i24:
				; ALIGN: # %bb.0:
				; ALIGN-NEXT: lb a1, 2(a0)
				; ALIGN-NEXT: lhu a0, 0(a0)
				; ALIGN-NEXT: slli a1, a1, 16
				; ALIGN-NEXT: or a0, a0, a1
				; ALIGN-NEXT: ret
	%res = load i24, i24* %p, align 1			%res = load i24, i24* %p, align 1
	ret i24 %res			ret i24 %res
	}			}

	define i32 @load_i32(i32* %p) {			define i32 @load_i32(i32* %p) {
	; RV32I-LABEL: load_i32:			; RV32I-LABEL: load_i32:
	; RV32I: # %bb.0:			; RV32I: # %bb.0:
	; RV32I-NEXT: lbu a1, 1(a0)			; RV32I-NEXT: lbu a1, 1(a0)
	Show All 16 Lines
	; RV64I-NEXT: lbu a0, 2(a0)			; RV64I-NEXT: lbu a0, 2(a0)
	; RV64I-NEXT: slli a1, a1, 8			; RV64I-NEXT: slli a1, a1, 8
	; RV64I-NEXT: or a1, a1, a2			; RV64I-NEXT: or a1, a1, a2
	; RV64I-NEXT: slli a2, a3, 8			; RV64I-NEXT: slli a2, a3, 8
	; RV64I-NEXT: or a0, a2, a0			; RV64I-NEXT: or a0, a2, a0
	; RV64I-NEXT: slli a0, a0, 16			; RV64I-NEXT: slli a0, a0, 16
	; RV64I-NEXT: or a0, a0, a1			; RV64I-NEXT: or a0, a0, a1
	; RV64I-NEXT: ret			; RV64I-NEXT: ret
				;
				; ALIGN-LABEL: load_i32:
				; ALIGN: # %bb.0:
				; ALIGN-NEXT: lw a0, 0(a0)
				; ALIGN-NEXT: ret
	%res = load i32, i32* %p, align 1			%res = load i32, i32* %p, align 1
	ret i32 %res			ret i32 %res
	}			}

	define i64 @load_i64(i64* %p) {			define i64 @load_i64(i64* %p) {
	; RV32I-LABEL: load_i64:			; RV32I-LABEL: load_i64:
	; RV32I: # %bb.0:			; RV32I: # %bb.0:
	; RV32I-NEXT: lbu a1, 1(a0)			; RV32I-NEXT: lbu a1, 1(a0)
	Show All 39 Lines
	; RV64I-NEXT: or a2, a2, a3			; RV64I-NEXT: or a2, a2, a3
	; RV64I-NEXT: slli a3, a4, 8			; RV64I-NEXT: slli a3, a4, 8
	; RV64I-NEXT: or a0, a3, a0			; RV64I-NEXT: or a0, a3, a0
	; RV64I-NEXT: slli a0, a0, 16			; RV64I-NEXT: slli a0, a0, 16
	; RV64I-NEXT: or a0, a0, a2			; RV64I-NEXT: or a0, a0, a2
	; RV64I-NEXT: slli a0, a0, 32			; RV64I-NEXT: slli a0, a0, 32
	; RV64I-NEXT: or a0, a0, a1			; RV64I-NEXT: or a0, a0, a1
	; RV64I-NEXT: ret			; RV64I-NEXT: ret
				;
				; ALIGN-RV32I-LABEL: load_i64:
				; ALIGN-RV32I: # %bb.0:
				; ALIGN-RV32I-NEXT: lw a2, 0(a0)
				; ALIGN-RV32I-NEXT: lw a1, 4(a0)
				; ALIGN-RV32I-NEXT: mv a0, a2
				; ALIGN-RV32I-NEXT: ret
				;
				; ALIGN-RV64I-LABEL: load_i64:
				; ALIGN-RV64I: # %bb.0:
				; ALIGN-RV64I-NEXT: ld a0, 0(a0)
				; ALIGN-RV64I-NEXT: ret
	%res = load i64, i64* %p, align 1			%res = load i64, i64* %p, align 1
	ret i64 %res			ret i64 %res
	}			}

	define void @store_i8(i8* %p, i8 %v) {			define void @store_i8(i8* %p, i8 %v) {
	; BOTH-LABEL: store_i8:			; ALL-LABEL: store_i8:
	; BOTH: # %bb.0:			; ALL: # %bb.0:
	; BOTH-NEXT: sb a1, 0(a0)			; ALL-NEXT: sb a1, 0(a0)
	; BOTH-NEXT: ret			; ALL-NEXT: ret
	store i8 %v, i8* %p, align 1			store i8 %v, i8* %p, align 1
	ret void			ret void
	}			}

	define void @store_i16(i16* %p, i16 %v) {			define void @store_i16(i16* %p, i16 %v) {
	; BOTH-LABEL: store_i16:			; NOALIGN-LABEL: store_i16:
	; BOTH: # %bb.0:			; NOALIGN: # %bb.0:
	; BOTH-NEXT: sb a1, 0(a0)			; NOALIGN-NEXT: sb a1, 0(a0)
	; BOTH-NEXT: srli a1, a1, 8			; NOALIGN-NEXT: srli a1, a1, 8
	; BOTH-NEXT: sb a1, 1(a0)			; NOALIGN-NEXT: sb a1, 1(a0)
	; BOTH-NEXT: ret			; NOALIGN-NEXT: ret
				;
				; ALIGN-LABEL: store_i16:
				; ALIGN: # %bb.0:
				; ALIGN-NEXT: sh a1, 0(a0)
				; ALIGN-NEXT: ret
	store i16 %v, i16* %p, align 1			store i16 %v, i16* %p, align 1
	ret void			ret void
	}			}

	define void @store_i24(i24* %p, i24 %v) {			define void @store_i24(i24* %p, i24 %v) {
	; BOTH-LABEL: store_i24:			; NOALIGN-LABEL: store_i24:
	; BOTH: # %bb.0:			; NOALIGN: # %bb.0:
	; BOTH-NEXT: sb a1, 0(a0)			; NOALIGN-NEXT: sb a1, 0(a0)
	; BOTH-NEXT: srli a2, a1, 8			; NOALIGN-NEXT: srli a2, a1, 8
	; BOTH-NEXT: sb a2, 1(a0)			; NOALIGN-NEXT: sb a2, 1(a0)
	; BOTH-NEXT: srli a1, a1, 16			; NOALIGN-NEXT: srli a1, a1, 16
	; BOTH-NEXT: sb a1, 2(a0)			; NOALIGN-NEXT: sb a1, 2(a0)
	; BOTH-NEXT: ret			; NOALIGN-NEXT: ret
				;
				; ALIGN-LABEL: store_i24:
				; ALIGN: # %bb.0:
				; ALIGN-NEXT: sh a1, 0(a0)
				; ALIGN-NEXT: srli a1, a1, 16
				; ALIGN-NEXT: sb a1, 2(a0)
				; ALIGN-NEXT: ret
	store i24 %v, i24* %p, align 1			store i24 %v, i24* %p, align 1
	ret void			ret void
	}			}

	define void @store_i32(i32* %p, i32 %v) {			define void @store_i32(i32* %p, i32 %v) {
	; BOTH-LABEL: store_i32:			; NOALIGN-LABEL: store_i32:
	; BOTH: # %bb.0:			; NOALIGN: # %bb.0:
	; BOTH-NEXT: sb a1, 0(a0)			; NOALIGN-NEXT: sb a1, 0(a0)
	; BOTH-NEXT: srli a2, a1, 24			; NOALIGN-NEXT: srli a2, a1, 24
	; BOTH-NEXT: sb a2, 3(a0)			; NOALIGN-NEXT: sb a2, 3(a0)
	; BOTH-NEXT: srli a2, a1, 16			; NOALIGN-NEXT: srli a2, a1, 16
	; BOTH-NEXT: sb a2, 2(a0)			; NOALIGN-NEXT: sb a2, 2(a0)
	; BOTH-NEXT: srli a1, a1, 8			; NOALIGN-NEXT: srli a1, a1, 8
	; BOTH-NEXT: sb a1, 1(a0)			; NOALIGN-NEXT: sb a1, 1(a0)
	; BOTH-NEXT: ret			; NOALIGN-NEXT: ret
				;
				; ALIGN-LABEL: store_i32:
				; ALIGN: # %bb.0:
				; ALIGN-NEXT: sw a1, 0(a0)
				; ALIGN-NEXT: ret
	store i32 %v, i32* %p, align 1			store i32 %v, i32* %p, align 1
	ret void			ret void
	}			}

	define void @store_i64(i64* %p, i64 %v) {			define void @store_i64(i64* %p, i64 %v) {
	; RV32I-LABEL: store_i64:			; RV32I-LABEL: store_i64:
	; RV32I: # %bb.0:			; RV32I: # %bb.0:
	; RV32I-NEXT: sb a2, 4(a0)			; RV32I-NEXT: sb a2, 4(a0)
	Show All 25 Lines
	; RV64I-NEXT: sb a2, 4(a0)			; RV64I-NEXT: sb a2, 4(a0)
	; RV64I-NEXT: srli a2, a1, 24			; RV64I-NEXT: srli a2, a1, 24
	; RV64I-NEXT: sb a2, 3(a0)			; RV64I-NEXT: sb a2, 3(a0)
	; RV64I-NEXT: srli a2, a1, 16			; RV64I-NEXT: srli a2, a1, 16
	; RV64I-NEXT: sb a2, 2(a0)			; RV64I-NEXT: sb a2, 2(a0)
	; RV64I-NEXT: srli a1, a1, 8			; RV64I-NEXT: srli a1, a1, 8
	; RV64I-NEXT: sb a1, 1(a0)			; RV64I-NEXT: sb a1, 1(a0)
	; RV64I-NEXT: ret			; RV64I-NEXT: ret
				;
				; ALIGN-RV32I-LABEL: store_i64:
				; ALIGN-RV32I: # %bb.0:
				; ALIGN-RV32I-NEXT: sw a2, 4(a0)
				; ALIGN-RV32I-NEXT: sw a1, 0(a0)
				; ALIGN-RV32I-NEXT: ret
				;
				; ALIGN-RV64I-LABEL: store_i64:
				; ALIGN-RV64I: # %bb.0:
				; ALIGN-RV64I-NEXT: sd a1, 0(a0)
				; ALIGN-RV64I-NEXT: ret
	store i64 %v, i64* %p, align 1			store i64 %v, i64* %p, align 1
	ret void			ret void
	}			}