This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/Driver/
-
Driver/
-
SanitizerArgs.cpp
-
ToolChain.cpp
-
test/
-
CodeGen/
1/1
shadowcallstack-attr.c
-
Driver/
-
sanitizer-ld.c
-
llvm/
-
lib/Target/RISCV/
-
Target/
-
RISCV/
20/22
RISCVFrameLowering.cpp
-
test/CodeGen/RISCV/
-
CodeGen/
-
RISCV/
6/6
shadowcallstack.ll

Differential D84414

[RISCV] Support Shadow Call Stack
ClosedPublic

Authored by zzheng on Jul 23 2020, 7:47 AM.

Download Raw Diff

Details

Reviewers

apazos
lenary
asb
jrtc27

Commits

rG1c466477ad46: [RISCV] Support Shadow Call Stack

Summary

[RISCV] Support Shadow Call Stack

Currenlty assume x18 is used as pointer to shadow call stack. User shall pass
flags:

"-fsanitize=shadow-call-stack -ffixed-x18"

Runtime supported is needed to setup x18.

If SCS is desired, all parts of the program should be built with -ffixed-x18 to
maintain inter-operatability.

There's no particuluar reason that we must use x18 as SCS pointer. Any register
may be used, as long as it does not have designated purpose already, like RA or
passing call arguments.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

zzheng created this revision.Jul 23 2020, 7:47 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJul 23 2020, 7:47 AM

Herald added subscribers: cfe-commits, aaron.ballman, evandro and 26 others. · View Herald Transcript

HsiangKai added a subscriber: HsiangKai.Jul 23 2020, 7:58 AM

aaron.ballman added inline comments.Jul 23 2020, 7:58 AM

clang/test/CodeGen/shadowcallstack-attr.c
22	Now might be a good opportunity to update this check prefix to a less loaded term.

jrtc27 requested changes to this revision.Jul 23 2020, 8:01 AM

jrtc27 added inline comments.

llvm/test/CodeGen/RISCV/shadowcallstack.ll
2–4	Please style these in the same way as the other RISC-V CodeGen tests, in terms of argument order, redirecting stdin rather than using `-o - %s`, and using riscv64 rather than riscv64-linux-gnu (unless needed?). Also use update_llc_test_checks.py rather than hand-writing this. And we generally use RV32I and RV64I (or other appropriate arch strings) instead of RISCV32 and RISCV64 prefixes.

This revision now requires changes to proceed.Jul 23 2020, 8:01 AM

Is there a reason for choosing X18? On AArch64 that's either a temporary or saved register depending on ABI, but determined to be the "platform register". Picking X18 on RISC-V because that's the same index as AArch64 seems a little arbitrary, but maybe it happens to make sense.

Harbormaster failed remote builds in B65374: Diff 280117!Jul 23 2020, 8:37 AM

apazos added inline comments.Jul 23 2020, 9:05 AM

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
95	There are thee things to observe here and other reviewers might have some additional comments: RISC-V does not have a reserved platform register like AAch64. The patch uses one of the RISC-V callee saved registers, x18, which happens to coincide with AArch64's register. It is possible to select another register, and additional checks for the flag combo "-fsanitize=shadow-call-stack -ffixed-xxxx" will have to be added. The return address is saved on both the SCS (whose location is protected/hidden) and also in the regular stack. But the return from a function uses the value saved on SCS. The understanding is that not saving it in the regular stack can impact debugging. The SCS is ascending, while the regular stack, by RISC-V convention, is descending. The SCS is not used for passing parameters between calls like the regular stack, so it seems to be ok. But this can be changed too. AArch64 's SCS is also ascending.

Using 'BLOCKED' now.

clang-formated RISCVFrameLowering.cpp

updated style of test/CodeGen/RISCV/shadowcallstack.ll

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
95	Thanks for the clarification, Ana.

Harbormaster completed remote builds in B65430: Diff 280210.Jul 23 2020, 12:27 PM

Please use local variables with meaningful names for RISCV::Xn rather than inlining them everywhere and making it harder at a glance to work out what's going on without knowing what the ABI register names are.

Please make a RISCVABI::getSCSPReg or similar function to avoid hard-coding X18 in a bunch of places.

I'm not convinced a callee-saved register makes any more sense than anything else. Whatever you pick, compatibility only works one way round. If foo uses an SCS and calls bar that doesn't, then being callee saved helps you, but if bar calls foo then (a) foo will try to dereference SCSP (to store then later load), likely faulting, perhaps instead "just" clobbering arbitrary memory (b) if foo manages to return back to bar without faulting then bar would expect the register to have been saved, but it hasn't, an ABI violation. If you use a caller-saved register instead then bar can call foo just fine, but foo can't call bar as it'll clobber its SCSP. Reserving any register like this is a fundamental ABI incompatibility, so picking x18 because it's callee-saved is meaningless, and picking it to avoid duplicating 6 lines of code (or fewer, if the existing 6 lines are refactored) isn't a good reason either. It's ultimately arbitrary, but it should be at least be based on some kind of valid reasoning if such reasoning exists. X18 may or may not be a right answer.

(One might initially think that this incompatibility is fine if you're building an executable with SCSP that calls other libraries without SCSP, but that breaks down as soon as callbacks exist, as well as more niche things like symbol preemption.)

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
33–37	if (find(CSI, RISCV::X1) == CSI.end()) return; (using `llvm::find` as a convenient wrapper around `std::find`, ie shorthand for `std::find(CSI.begin(), CSI.end(), RISCV::X1)`). Though personally I'd prefer to see X1 come from `RI.getRARegister()` rather than be hard-coded; other functions in this file already hard-code it, but in our CHERI fork we need to replace RISCV::X1 with RISCV::C1 everywhere so have changed those. Having said that, CHERI renders a shadow call stack unnecessary, so I don't particularly care if it's broken there, personally. But I still think it's nicer code.
54	This is wrong for RV64.
69–73	As above.
94	Also wrong for RV64.

This revision now requires changes to proceed.Jul 23 2020, 12:37 PM

MaskRay added inline comments.Jul 23 2020, 12:42 PM

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
33–37	`!llvm::is_contained(CSI, RISCV::X1)`

pcc added a subscriber: pcc.Jul 23 2020, 12:44 PM

pcc added inline comments.

llvm/test/CodeGen/RISCV/shadowcallstack.ll
12	Shouldn't it be looking for `s2` since that's how `x18` is spelled in assembly?

jrtc27 added inline comments.Jul 23 2020, 12:48 PM

llvm/test/CodeGen/RISCV/shadowcallstack.ll
2	As I said before, please just use `-mtriple=riscv32`. The `-unknown-elf` is implied, irrelevant and wastes space, so all the OS-independent CodeGen tests just specify the CPU.
3	Two extra spaces to indent the \| is the predominant style.
4	Delete this blank line.
12	The -NOTs shouldn't even exist, this isn't how you use `update_llc_test_checks.py`. But yes, by default that's how it'll be printed unless you disable printing aliases.

Any callee-saved register can be used.

In fact, any register may be used, as long as it cannot be used to pass arguments or return values. It may be better to select a temporary register, as x18 is on aarch64 when not being used as a platform register, so that problems are noticed more easily (i.e. more likely to be clobbered by incompatible code).

Addressed styling & code clarity issues.

Fixed wrong opcode for RV64.

Unlike x18 on AArch64, there's no register that should normally be reserved/not-used on RISCV. So using any eligible register would break ABI compatibility. x18 is neither a better nor a worse choice than other registers. Non-SCS code should be built with -ffixed-x18 to be compatible with SCS-enabled code.

zzheng marked 2 inline comments as done.Jul 27 2020, 1:24 PM

zzheng added inline comments.

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
33–37	Not sure how to make llvm::find or llvm::is_contained work in this scenario. CSI is a std::vector<llvm::CalleeSavedInfo>. We need a getReg() for each element in it before comparing to a 'Register'

clang-formatted.

Harbormaster failed remote builds in B65914: Diff 281038!Jul 27 2020, 4:08 PM

Harbormaster failed remote builds in B65927: Diff 281062!Jul 27 2020, 5:31 PM

rebased..

Harbormaster completed remote builds in B66107: Diff 281377.Jul 28 2020, 5:40 PM

There is a possibly-compelling argument against using x18: RV32E only gives x0-x15, so would not be able to support the current implementation.

In D84414#2186257, @jrtc27 wrote:

There is a possibly-compelling argument against using x18: RV32E only gives x0-x15, so would not be able to support the current implementation.

We discussed this on the RISC-V LLVM Sync-up (both this time and on 6 August 2020 IIRC). The consensus view is: if you're on an rv32e implementation, you're potentially too constrained to use a shadow call stack anyway. Even then, LLVM doesn't implement the rest of rv32e yet (though there are plans to do so at some point, which means we can revisit this change).

Our feeling is that if users have a distinct need to use a different register at the moment, they can use a downstream change to LLVM. The fact that we use RISCVABI::getSCSPReg() should make this fairly easy.

If the register choice is the only concern about this work, then I think we can probably land it as-is and fixup the register choice if we see major drawbacks later. Yes, it's an ABI issue, but on the other hand the shadow call stack is not a standard ABI anyway.

Rebased & ping...

IMHO, the patch is in good shape. As we discussed in the bi-weekly meetings, RV32E only has 16 registers. Systems based on RV32E may have limited memory as well. Besides, LLVM does not have full support for RV32E yet. We can commit this patch as-is and change it later if RV32E needs SCS.

Why do we have to pass -ffixed-x18 when compiling? Is it enough to just reserve x18 whenever the function has the shadow call stack attribute?

Harbormaster completed remote builds in B69340: Diff 287428.Aug 24 2020, 10:41 AM

In D84414#2234112, @lenary wrote:

Why do we have to pass -ffixed-x18 when compiling? Is it enough to just reserve x18 whenever the function has the shadow call stack attribute?

When SCS is on, x18 must be preserved across calls. Given it's a callee-saved, value in x18 is preserved by functions that do not have SCS attribute.

However, saving x18 on stack risks leaking SCS's location in memory, making the defense useless.

In D84414#2234255, @zzheng wrote:

In D84414#2234112, @lenary wrote:

Why do we have to pass -ffixed-x18 when compiling? Is it enough to just reserve x18 whenever the function has the shadow call stack attribute?

When SCS is on, x18 must be preserved across calls. Given it's a callee-saved, value in x18 is preserved by functions that do not have SCS attribute.

However, saving x18 on stack risks leaking SCS's location in memory, making the defense useless.

Ok, so any compilation units without -fsanitize=shadow-call-stack should be compiled with -ffixed-x18 if you want to link those together. That is reasonable. My question was whether we can ensure that -fsanitize=shadow-call-stack can imply -ffixed-x18 rather than having to pass both options.

It is my understanding that all functions in a CU with -fsanitize=shadow-call-stack will get the SCS function attribute, so why can't we use that attribute to work out whether x18 should be reserved, rather than using -ffixed-x18? You'll see RISCVRegisterInfo::getReservedRegs reserves fp and bp only if they're needed by the function (which can be based on attributes or other info), rather than using isRegisterReservedByUser - which seems cleaner to me.

FWIW, on aarch64 we decided to make -fsanitize=shadow-call-stack require the x18 reservation (instead of implying it) to try to avoid ABI mismatch problems. That is, it should be safe to mix and match code compiled with and without -fsanitize=shadow-call-stack. If we make -fsanitize=shadow-call-stack imply the x18 reservation, it makes it more likely that someone will accidentally build and link in incompatible code that does not reserve x18.

In D84414#2234327, @pcc wrote:

FWIW, on aarch64 we decided to make -fsanitize=shadow-call-stack require the x18 reservation (instead of implying it) to try to avoid ABI mismatch problems. That is, it should be safe to mix and match code compiled with and without -fsanitize=shadow-call-stack. If we make -fsanitize=shadow-call-stack imply the x18 reservation, it makes it more likely that someone will accidentally build and link in incompatible code that does not reserve x18.

Ok, that approach does also make sense.

In D84414#2234267, @lenary wrote:

Ok, so any compilation units without -fsanitize=shadow-call-stack should be compiled with -ffixed-x18 if you want to link those together. That is reasonable. My question was whether we can ensure that -fsanitize=shadow-call-stack can imply -ffixed-x18 rather than having to pass both options.

It is my understanding that all functions in a CU with -fsanitize=shadow-call-stack will get the SCS function attribute, so why can't we use that attribute to work out whether x18 should be reserved, rather than using -ffixed-x18? You'll see RISCVRegisterInfo::getReservedRegs reserves fp and bp only if they're needed by the function (which can be based on attributes or other info), rather than using isRegisterReservedByUser - which seems cleaner to me.

In D84414#2234327, @pcc wrote:

FWIW, on aarch64 we decided to make -fsanitize=shadow-call-stack require the x18 reservation (instead of implying it) to try to avoid ABI mismatch problems. That is, it should be safe to mix and match code compiled with and without -fsanitize=shadow-call-stack. If we make -fsanitize=shadow-call-stack imply the x18 reservation, it makes it more likely that someone will accidentally build and link in incompatible code that does not reserve x18.

Yes, binding -fsanitize=shadow-call-stack with -ffixed-x18 is to ensure user don't accidentally link SCS-enabled code with non-SCS code that uses/overwrites x18. We can always infer that x18 is reserved when SCS in on.

There's no need to check RISCVRegisterInfo::getReservedRegs for x18 by looking into function attr. It forms a circular chain: is x18 reserved for SCS? <--> SCS is enabled, so x18 should be reserved.

ping..

@jrt27, @lenary, @asb, IMHO, the patch is in good shape now, all concerns raised in comments has been addressed/answered, is there any additional comments before we can land it?

rebase & ping..

This is currently incompatible with the save/restore libcalls, and I don't think there's any way to avoid that (the restore libcall both loads ra and jumps to it). We should ensure combining them gives an error.

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
36–37
43	Pointless comment; remove
46	Pointless comment; remove
53	This should be passed in as an argument IMO (same for the epilogue) given the standard prologue/epilogue code already has a DebugLoc lying around.
59–60
60	Is it intended that the shadow call stack grows up unlike the normal stack?
79–80	No need to repeat ourselves.
102–103	Although in fact you have both a bug and a minor performance issue with this, and it should be: // l[w\|d] ra, [-4\|-8](s2) // addi s2, s2, -[4\|8] Then there's no read-after-write dependency chain, which is better for out-of-order cores. The bug is that, currently, you risk a signal handler clobbering your SCS slot in between the two instructions, since you deallocate the frame before you read from it. Will be rare in practice, but a possibility.

This revision now requires changes to proceed.Sep 16 2020, 11:20 AM

jrtc27 added inline comments.Sep 16 2020, 11:21 AM

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
86	Pointless comment; remove
89	Pointless comment; remove

Harbormaster completed remote builds in B71904: Diff 292274.Sep 16 2020, 11:49 AM

Addressed comments by @jrtc27

zzheng marked 2 inline comments as done.Sep 16 2020, 4:25 PM

zzheng added inline comments.

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
60	No. Which direction the SCS grows on is trivial. The memory area hosting SCS is independent of the regular stack; and it's provided by the runtime. mmap/malloc returns the low address of newly mapped/allocated area. Making the SCS growing down requires the runtime to return upper bound of the SCS. On AArch64, the SCS grows up as well.

jrtc27 added inline comments.Sep 16 2020, 4:29 PM

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp
38
60	Ok. Wasn't saying there was anything wrong with it, was just something that jumped out at me. Having it grow up makes more sense (the only real advantage to the normal stack growing down these days is that doing aligned allocations is slightly cheaper).
79–80	This still holds.

Harbormaster completed remote builds in B71945: Diff 292372.Sep 16 2020, 5:03 PM

Fixed comment and lint

Harbormaster completed remote builds in B71952: Diff 292378.Sep 16 2020, 6:03 PM

I think once @jrtc27 confirms all her issues are addressed this is good to land.

Yes I think everything's been addressed now (though if I keep looking over it I might start nit-picking even more :)).

This revision is now accepted and ready to land.Sep 17 2020, 5:51 AM

This revision was landed with ongoing or failed builds.Sep 17 2020, 4:02 PM

Closed by commit rG1c466477ad46: [RISCV] Support Shadow Call Stack (authored by zzheng). · Explain Why

This revision was automatically updated to reflect the committed changes.

zzheng added a commit: rG1c466477ad46: [RISCV] Support Shadow Call Stack.

paulkirth mentioned this in D146463: [CodeGen][RISCV] Change Shadow Call Stack Register to X3.Mar 20 2023, 3:08 PM

paulkirth mentioned this in rGaa1d2693c256: [CodeGen][RISCV] Change Shadow Call Stack Register to X3.Apr 12 2023, 2:06 PM

paulkirth mentioned this in D149099: [RISCV] Make SCS prologue interrupt safe on RISC-V.Apr 24 2023, 2:20 PM

paulkirth mentioned this in rGbface3947ea1: [RISCV] Make SCS prologue interrupt safe on RISC-V.Apr 26 2023, 8:58 AM

Revision Contents

Path

Size

clang/

lib/

Driver/

SanitizerArgs.cpp

6 lines

ToolChain.cpp

3 lines

test/

CodeGen/

shadowcallstack-attr.c

24 lines

Driver/

sanitizer-ld.c

10 lines

llvm/

lib/

Target/

RISCV/

RISCVFrameLowering.cpp

80 lines

test/

CodeGen/

RISCV/

shadowcallstack.ll

179 lines

Diff 280210

clang/lib/Driver/SanitizerArgs.cpp

Show First 20 Lines • Show All 485 Lines • ▼ Show 20 Lines	SanitizerArgs::SanitizerArgs(const ToolChain &TC,
}		}

// Check that LTO is enabled if we need it.		// Check that LTO is enabled if we need it.
if ((Kinds & NeedsLTO) && !D.isUsingLTO()) {		if ((Kinds & NeedsLTO) && !D.isUsingLTO()) {
D.Diag(diag::err_drv_argument_only_allowed_with)		D.Diag(diag::err_drv_argument_only_allowed_with)
<< lastArgumentForMask(D, Args, Kinds & NeedsLTO) << "-flto";		<< lastArgumentForMask(D, Args, Kinds & NeedsLTO) << "-flto";
}		}

if ((Kinds & SanitizerKind::ShadowCallStack) && TC.getTriple().isAArch64() &&		if ((Kinds & SanitizerKind::ShadowCallStack) &&
!llvm::AArch64::isX18ReservedByDefault(TC.getTriple()) &&		((TC.getTriple().isAArch64() &&
		!llvm::AArch64::isX18ReservedByDefault(TC.getTriple())) \|\|
		TC.getTriple().isRISCV()) &&
!Args.hasArg(options::OPT_ffixed_x18)) {		!Args.hasArg(options::OPT_ffixed_x18)) {
D.Diag(diag::err_drv_argument_only_allowed_with)		D.Diag(diag::err_drv_argument_only_allowed_with)
<< lastArgumentForMask(D, Args, Kinds & SanitizerKind::ShadowCallStack)		<< lastArgumentForMask(D, Args, Kinds & SanitizerKind::ShadowCallStack)
<< "-ffixed-x18";		<< "-ffixed-x18";
}		}

// Report error if there are non-trapping sanitizers that require		// Report error if there are non-trapping sanitizers that require
// c++abi-specific parts of UBSan runtime, and they are not provided by the		// c++abi-specific parts of UBSan runtime, and they are not provided by the
▲ Show 20 Lines • Show All 696 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChain.cpp

Show First 20 Lines • Show All 1,018 Lines • ▼ Show 20 Lines	SanitizerMask Res = (SanitizerKind::Undefined & ~SanitizerKind::Vptr &
SanitizerKind::UnsignedIntegerOverflow \|		SanitizerKind::UnsignedIntegerOverflow \|
SanitizerKind::ImplicitConversion \|		SanitizerKind::ImplicitConversion \|
SanitizerKind::Nullability \| SanitizerKind::LocalBounds;		SanitizerKind::Nullability \| SanitizerKind::LocalBounds;
if (getTriple().getArch() == llvm::Triple::x86 \|\|		if (getTriple().getArch() == llvm::Triple::x86 \|\|
getTriple().getArch() == llvm::Triple::x86_64 \|\|		getTriple().getArch() == llvm::Triple::x86_64 \|\|
getTriple().getArch() == llvm::Triple::arm \|\| getTriple().isWasm() \|\|		getTriple().getArch() == llvm::Triple::arm \|\| getTriple().isWasm() \|\|
getTriple().isAArch64())		getTriple().isAArch64())
Res \|= SanitizerKind::CFIICall;		Res \|= SanitizerKind::CFIICall;
if (getTriple().getArch() == llvm::Triple::x86_64 \|\| getTriple().isAArch64())		if (getTriple().getArch() == llvm::Triple::x86_64 \|\|
		getTriple().isAArch64() \|\| getTriple().isRISCV())
Res \|= SanitizerKind::ShadowCallStack;		Res \|= SanitizerKind::ShadowCallStack;
if (getTriple().isAArch64())		if (getTriple().isAArch64())
Res \|= SanitizerKind::MemTag;		Res \|= SanitizerKind::MemTag;
return Res;		return Res;
}		}

void ToolChain::AddCudaIncludeArgs(const ArgList &DriverArgs,		void ToolChain::AddCudaIncludeArgs(const ArgList &DriverArgs,
ArgStringList &CC1Args) const {}		ArgStringList &CC1Args) const {}
▲ Show 20 Lines • Show All 210 Lines • Show Last 20 Lines

clang/test/CodeGen/shadowcallstack-attr.c

	// RUN: %clang_cc1 -triple x86_64-linux-unknown -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=UNBLACKLISTED %s			// RUN: %clang_cc1 -triple x86_64-linux-unknown -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=UNBLOCKLISTED %s

	// RUN: %clang_cc1 -D ATTR -triple x86_64-linux-unknown -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=BLACKLISTED %s			// RUN: %clang_cc1 -D ATTR -triple x86_64-linux-unknown -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=BLOCKLISTED %s

	// RUN: echo -e "[shadow-call-stack]\nfun:foo" > %t			// RUN: echo -e "[shadow-call-stack]\nfun:foo" > %t
	// RUN: %clang_cc1 -fsanitize-blacklist=%t -triple x86_64-linux-unknown -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=BLACKLISTED %s			// RUN: %clang_cc1 -fsanitize-blacklist=%t -triple x86_64-linux-unknown -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=BLOCKLISTED %s

				// RUN: %clang_cc1 -triple riscv32-linux-gnu -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=UNBLOCKLISTED %s

				// RUN: %clang_cc1 -D ATTR -triple riscv32-linux-gnu -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=BLOCKLISTED %s

				// RUN: echo -e "[shadow-call-stack]\nfun:foo" > %t
				// RUN: %clang_cc1 -fsanitize-blacklist=%t -triple riscv32-linux-gnu -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=BLOCKLISTED %s

				// RUN: %clang_cc1 -triple riscv64-linux-gnu -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=UNBLOCKLISTED %s

				// RUN: %clang_cc1 -D ATTR -triple riscv64-linux-gnu -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=BLOCKLISTED %s

				// RUN: echo -e "[shadow-call-stack]\nfun:foo" > %t
				// RUN: %clang_cc1 -fsanitize-blacklist=%t -triple riscv64-linux-gnu -emit-llvm -o - %s -fsanitize=shadow-call-stack \| FileCheck -check-prefix=BLOCKLISTED %s

	#ifdef ATTR			#ifdef ATTR
				aaron.ballmanUnsubmitted Done Reply Inline Actions Now might be a good opportunity to update this check prefix to a less loaded term. aaron.ballman: Now might be a good opportunity to update this check prefix to a less loaded term.
	__attribute__((no_sanitize("shadow-call-stack")))			__attribute__((no_sanitize("shadow-call-stack")))
	#endif			#endif
	int foo(int a) { return a; }			int foo(int a) { return a; }

	// CHECK: define i32 @foo(i32* %a)			// CHECK: define i32 @foo(i32* %a)

	// BLACKLISTED-NOT: attributes {{.}}shadowcallstack{{.}}			// BLOCKLISTED-NOT: attributes {{.}}shadowcallstack{{.}}
	// UNBLACKLISTED: attributes {{.}}shadowcallstack{{.}}			// UNBLOCKLISTED: attributes {{.}}shadowcallstack{{.}}

clang/test/Driver/sanitizer-ld.c

	Show First 20 Lines • Show All 609 Lines • ▼ Show 20 Lines
	// CHECK-SHADOWCALLSTACK-LINUX-X86-64-NOT: error:			// CHECK-SHADOWCALLSTACK-LINUX-X86-64-NOT: error:

	// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \			// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \
	// RUN: -target aarch64-unknown-linux -fuse-ld=ld \			// RUN: -target aarch64-unknown-linux -fuse-ld=ld \
	// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-AARCH64 %s			// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-AARCH64 %s
	// CHECK-SHADOWCALLSTACK-LINUX-AARCH64: '-fsanitize=shadow-call-stack' only allowed with '-ffixed-x18'			// CHECK-SHADOWCALLSTACK-LINUX-AARCH64: '-fsanitize=shadow-call-stack' only allowed with '-ffixed-x18'

	// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \			// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \
				// RUN: -target riscv32-unknown-elf -fuse-ld=ld \
				// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-ELF-RISCV32 %s
				// CHECK-SHADOWCALLSTACK-ELF-RISCV32: '-fsanitize=shadow-call-stack' only allowed with '-ffixed-x18'

				// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \
				// RUN: -target riscv64-unknown-linux -fuse-ld=ld \
				// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-RISCV64 %s
				// CHECK-SHADOWCALLSTACK-LINUX-RISCV64: '-fsanitize=shadow-call-stack' only allowed with '-ffixed-x18'

				// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \
	// RUN: -target aarch64-unknown-linux -fuse-ld=ld -ffixed-x18 \			// RUN: -target aarch64-unknown-linux -fuse-ld=ld -ffixed-x18 \
	// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-AARCH64-X18 %s			// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-AARCH64-X18 %s
	// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \			// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \
	// RUN: -target arm64-unknown-ios -fuse-ld=ld \			// RUN: -target arm64-unknown-ios -fuse-ld=ld \
	// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-AARCH64-X18 %s			// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-AARCH64-X18 %s
	// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \			// RUN: %clang -fsanitize=shadow-call-stack %s -### -o %t.o 2>&1 \
	// RUN: -target aarch64-unknown-linux-android -fuse-ld=ld \			// RUN: -target aarch64-unknown-linux-android -fuse-ld=ld \
	// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-AARCH64-X18 %s			// RUN: \| FileCheck --check-prefix=CHECK-SHADOWCALLSTACK-LINUX-AARCH64-X18 %s
	▲ Show 20 Lines • Show All 260 Lines • Show Last 20 Lines

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp

Show All 17 Lines

#include "llvm/CodeGen/MachineInstrBuilder.h" #include "llvm/CodeGen/MachineInstrBuilder.h"

#include "llvm/CodeGen/MachineRegisterInfo.h" #include "llvm/CodeGen/MachineRegisterInfo.h"

#include "llvm/CodeGen/RegisterScavenging.h" #include "llvm/CodeGen/RegisterScavenging.h"

#include "llvm/IR/DiagnosticInfo.h" #include "llvm/IR/DiagnosticInfo.h"

#include "llvm/MC/MCDwarf.h" #include "llvm/MC/MCDwarf.h"

using namespace llvm; using namespace llvm;

// For now we use x18, a.k.a s2, as pointer to shadow call stack.

// User should explicitly set -ffixed-x18 and not use x18 in their asm.

static void emitSCSPrologue(MachineFunction &MF, MachineBasicBlock &MBB,

MachineBasicBlock::iterator MI) {

if (!MF.getFunction().hasFnAttribute(Attribute::ShadowCallStack))

return;

std::vector<CalleeSavedInfo> &CSI = MF.getFrameInfo().getCalleeSavedInfo();

if (std::none_of(CSI.begin(), CSI.end(), [](CalleeSavedInfo &CSR) {

return CSR.getReg() == RISCV::X1;

}))

return;

jrtc27Unsubmitted

Not Done

if (find(CSI, RISCV::X1) == CSI.end())
  return;

(using llvm::find as a convenient wrapper around std::find, ie shorthand for std::find(CSI.begin(), CSI.end(), RISCV::X1)). Though personally I'd prefer to see X1 come from RI.getRARegister() rather than be hard-coded; other functions in this file already hard-code it, but in our CHERI fork we need to replace RISCV::X1 with RISCV::C1 everywhere so have changed those. Having said that, CHERI renders a shadow call stack unnecessary, so I don't particularly care if it's broken there, personally. But I still think it's nicer code.

jrtc27: ``` if (find(CSI, RISCV::X1) == CSI.end()) return; ``` (using `llvm::find` as a…

MaskRayUnsubmitted

Not Done

!llvm::is_contained(CSI, RISCV::X1)

MaskRay: `!llvm::is_contained(CSI, RISCV::X1)`

zzhengAuthorUnsubmitted

Done

Not sure how to make llvm::find or llvm::is_contained work in this scenario.

CSI is a std::vector<llvm::CalleeSavedInfo>. We need a getReg() for each element in it before comparing to a 'Register'

zzheng: Not sure how to make llvm::find or llvm::is_contained work in this scenario. CSI is a std…

jrtc27Unsubmitted

Done

- // Do not save RA to SCS if it's not saved to regular stack, i.e.

- // RA is not subject to overwritten.

+ // Do not save RA to the SCS if it's not saved to the regular stack,

+ // i.e. RA is not at risk of being overwritten.

std::vector<CalleeSavedInfo> &CSI = MF.getFrameInfo().getCalleeSavedInfo();

jrtc27:

jrtc27Unsubmitted

Done

// Do not save RA to the SCS if it's not saved to the regular stack,

- // i.e. RA is not at risk of being to overwritten.

+ // i.e. RA is not at risk of being overwritten.

std::vector<CalleeSavedInfo> &CSI = MF.getFrameInfo().getCalleeSavedInfo();

jrtc27:

const auto &STI = MF.getSubtarget<RISCVSubtarget>();

// Emit an error message and bail out.

if (!STI.isRegisterReservedByUser(RISCV::X18)) {

MF.getFunction().getContext().diagnose(DiagnosticInfoUnsupported{

MF.getFunction(), "x18 not reserved by user for Shadow Call Stack."});

jrtc27Unsubmitted

Done

Pointless comment; remove

jrtc27: Pointless comment; remove

return;

}

jrtc27Unsubmitted

Done

Pointless comment; remove

jrtc27: Pointless comment; remove

DebugLoc DL = MI != MBB.end() ? MI->getDebugLoc() : DebugLoc();

const RISCVInstrInfo *TII = STI.getInstrInfo();

int64_t SlotSize = STI.getXLen() / 8;

// Store return address to shadow call stack

// sw ra, 0(s2)

// addi s2, s2, 4

jrtc27Unsubmitted

Done

This should be passed in as an argument IMO (same for the epilogue) given the standard prologue/epilogue code already has a DebugLoc lying around.

jrtc27: This should be passed in as an argument IMO (same for the epilogue) given the standard…

BuildMI(MBB, MI, DL, TII->get(RISCV::SW))

jrtc27Unsubmitted

Done

This is wrong for RV64.

jrtc27: This is wrong for RV64.

.addReg(RISCV::X1)

.addReg(RISCV::X18)

.addImm(0);

BuildMI(MBB, MI, DL, TII->get(RISCV::ADDI))

.addReg(RISCV::X18, RegState::Define)

.addReg(RISCV::X18)

jrtc27Unsubmitted

Done

Is it intended that the shadow call stack grows *up* unlike the normal stack?

jrtc27: Is it intended that the shadow call stack grows *up* unlike the normal stack?

zzhengAuthorUnsubmitted

Done

No. Which direction the SCS grows on is trivial.

The memory area hosting SCS is independent of the regular stack; and it's provided by the runtime.
mmap/malloc returns the low address of newly mapped/allocated area. Making the SCS growing down requires the runtime to return upper bound of the SCS. On AArch64, the SCS grows up as well.

zzheng: No. Which direction the SCS grows on is trivial. The memory area hosting SCS is independent of…

jrtc27Unsubmitted

Done

Ok. Wasn't saying there was anything wrong with it, was just something that jumped out at me. Having it grow up makes more sense (the only real advantage to the normal stack growing down these days is that doing aligned allocations is slightly cheaper).

jrtc27: Ok. Wasn't saying there was anything wrong with it, was just something that jumped out at me.

jrtc27Unsubmitted

Done

// Store return address to shadow call stack

- // sw ra, 0(s2)

- // addi s2, s2, 4

+ // s[w|d] ra, 0(s2)

+ // addi s2, s2, [4|8]

BuildMI(MBB, MI, DL, TII->get(IsRV64 ? RISCV::SD : RISCV::SW))

jrtc27:

.addImm(SlotSize);

}

static void emitSCSEpilogue(MachineFunction &MF, MachineBasicBlock &MBB,

MachineBasicBlock::iterator MI) {

if (!MF.getFunction().hasFnAttribute(Attribute::ShadowCallStack))

return;

std::vector<CalleeSavedInfo> &CSI = MF.getFrameInfo().getCalleeSavedInfo();

if (std::none_of(CSI.begin(), CSI.end(), [](CalleeSavedInfo &CSR) {

return CSR.getReg() == RISCV::X1;

}))

return;

jrtc27Unsubmitted

Done

As above.

jrtc27: As above.

const auto &STI = MF.getSubtarget<RISCVSubtarget>();

// Emit an error message and bail out.

if (!STI.isRegisterReservedByUser(RISCV::X18)) {

MF.getFunction().getContext().diagnose(DiagnosticInfoUnsupported{

MF.getFunction(), "x18 not reserved by user for Shadow Call Stack."});

return;

jrtc27Unsubmitted

Done

- // Do not restore RA from SCS if it's not saved to regular stack, i.e.

- // RA is not subject to overwritten.

+ // See emitSCSPrologue

std::vector<CalleeSavedInfo> &CSI = MF.getFrameInfo().getCalleeSavedInfo();

No need to repeat ourselves.

jrtc27: No need to repeat ourselves.

jrtc27Unsubmitted

Done

This still holds.

jrtc27: This still holds.

}

DebugLoc DL = MI != MBB.end() ? MI->getDebugLoc() : DebugLoc();

const RISCVInstrInfo *TII = STI.getInstrInfo();

int64_t SlotSize = STI.getXLen() / 8;

jrtc27Unsubmitted

Done

Pointless comment; remove

jrtc27: Pointless comment; remove

// Load return address from shadow call stack

// addi s2, s2, -4

// lw ra, 0(s2)

jrtc27Unsubmitted

Done

Pointless comment; remove

jrtc27: Pointless comment; remove

BuildMI(MBB, MI, DL, TII->get(RISCV::ADDI))

.addReg(RISCV::X18, RegState::Define)

.addReg(RISCV::X18)

.addImm(-SlotSize);

BuildMI(MBB, MI, DL, TII->get(RISCV::LW))

jrtc27Unsubmitted

Done

Also wrong for RV64.

jrtc27: Also wrong for RV64.

.addReg(RISCV::X1, RegState::Define)

apazosUnsubmitted

Done

There are thee things to observe here and other reviewers might have some additional comments:

RISC-V does not have a reserved platform register like AAch64. The patch uses one of the RISC-V callee saved registers, x18, which happens to coincide with AArch64's register. It is possible to select another register, and additional checks for the flag combo "-fsanitize=shadow-call-stack -ffixed-xxxx" will have to be added.

The return address is saved on both the SCS (whose location is protected/hidden) and also in the regular stack. But the return from a function uses the value saved on SCS. The understanding is that not saving it in the regular stack can impact debugging.

The SCS is ascending, while the regular stack, by RISC-V convention, is descending. The SCS is not used for passing parameters between calls like the regular stack, so it seems to be ok. But this can be changed too. AArch64 's SCS is also ascending.

apazos: There are thee things to observe here and other reviewers might have some additional comments…

zzhengAuthorUnsubmitted

Done

Thanks for the clarification, Ana.

zzheng: Thanks for the clarification, Ana.

.addReg(RISCV::X18)

.addImm(0);

}

// Get the ID of the libcall used for spilling and restoring callee saved // Get the ID of the libcall used for spilling and restoring callee saved

// registers. The ID is representative of the number of registers saved or // registers. The ID is representative of the number of registers saved or

// restored by the libcall, except it is zero-indexed - ID 0 corresponds to a // restored by the libcall, except it is zero-indexed - ID 0 corresponds to a

// single register. // single register.

jrtc27Unsubmitted

Done

// Load return address from shadow call stack

- // addi s2, s2, -4

- // lw ra, 0(s2)

+ // addi s2, s2, -[4|8]

+ // l[w|d] ra, 0(s2)

BuildMI(MBB, MI, DL, TII->get(RISCV::ADDI))

Although in fact you have both a bug and a minor performance issue with this, and it should be:

// l[w|d] ra, [-4|-8](s2)
// addi   s2, s2, -[4|8]

Then there's no read-after-write dependency chain, which is better for out-of-order cores.

The bug is that, currently, you risk a signal handler clobbering your SCS slot in between the two instructions, since you deallocate the frame before you read from it. Will be rare in practice, but a possibility.

jrtc27: Although in fact you have both a bug and a minor performance issue with this, and it should be…

static int getLibCallID(const MachineFunction &MF, static int getLibCallID(const MachineFunction &MF,

const std::vector<CalleeSavedInfo> &CSI) { const std::vector<CalleeSavedInfo> &CSI) {

const auto *RVFI = MF.getInfo<RISCVMachineFunctionInfo>(); const auto *RVFI = MF.getInfo<RISCVMachineFunctionInfo>();

if (CSI.empty() || !RVFI->useSaveRestoreLibCalls(MF)) if (CSI.empty() || !RVFI->useSaveRestoreLibCalls(MF))

return -1; return -1;

▲ Show 20 Lines • Show All 179 Lines • ▼ Show 20 Lines void RISCVFrameLowering::emitPrologue(MachineFunction &MF,

const RISCVRegisterInfo *RI = STI.getRegisterInfo(); const RISCVRegisterInfo *RI = STI.getRegisterInfo();

const RISCVInstrInfo *TII = STI.getInstrInfo(); const RISCVInstrInfo *TII = STI.getInstrInfo();

MachineBasicBlock::iterator MBBI = MBB.begin(); MachineBasicBlock::iterator MBBI = MBB.begin();

// Emit prologue for shadow call stack.

emitSCSPrologue(MF, MBB, MBBI);

// Since spillCalleeSavedRegisters may have inserted a libcall, skip past // Since spillCalleeSavedRegisters may have inserted a libcall, skip past

// any instructions marked as FrameSetup // any instructions marked as FrameSetup

while (MBBI != MBB.end() && MBBI->getFlag(MachineInstr::FrameSetup)) while (MBBI != MBB.end() && MBBI->getFlag(MachineInstr::FrameSetup))

++MBBI; ++MBBI;

// Debug location must be unknown since the first debug location is used // Debug location must be unknown since the first debug location is used

// to determine the end of the prologue. // to determine the end of the prologue.

DebugLoc DL; DebugLoc DL;

▲ Show 20 Lines • Show All 219 Lines • ▼ Show 20 Lines adjustReg(MBB, LastFrameDestroy, DL, SPReg, SPReg, SecondSPAdjustAmount,

MachineInstr::FrameDestroy); MachineInstr::FrameDestroy);

} }

if (FirstSPAdjustAmount) if (FirstSPAdjustAmount)

StackSize = FirstSPAdjustAmount; StackSize = FirstSPAdjustAmount;

// Deallocate stack // Deallocate stack

adjustReg(MBB, MBBI, DL, SPReg, SPReg, StackSize, MachineInstr::FrameDestroy); adjustReg(MBB, MBBI, DL, SPReg, SPReg, StackSize, MachineInstr::FrameDestroy);

// Emit epilogue for shadow call stack.

emitSCSEpilogue(MF, MBB, MBBI);

} }

int RISCVFrameLowering::getFrameIndexReference(const MachineFunction &MF, int RISCVFrameLowering::getFrameIndexReference(const MachineFunction &MF,

int FI, int FI,

const MachineFrameInfo &MFI = MF.getFrameInfo(); const MachineFrameInfo &MFI = MF.getFrameInfo();

const TargetRegisterInfo *RI = MF.getSubtarget().getRegisterInfo(); const TargetRegisterInfo *RI = MF.getSubtarget().getRegisterInfo();

const auto *RVFI = MF.getInfo<RISCVMachineFunctionInfo>(); const auto *RVFI = MF.getInfo<RISCVMachineFunctionInfo>();

▲ Show 20 Lines • Show All 309 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/shadowcallstack.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc -mtriple=riscv32-unknown-elf -mattr=+reserve-x18 -verify-machineinstrs < %s \
				jrtc27Unsubmitted Done Reply Inline Actions As I said before, please just use `-mtriple=riscv32`. The `-unknown-elf` is implied, irrelevant and wastes space, so all the OS-independent CodeGen tests just specify the CPU. jrtc27: As I said before, please just use `-mtriple=riscv32`. The `-unknown-elf` is implied, irrelevant…
				; RUN: \| FileCheck %s --check-prefix=RV32
				jrtc27Unsubmitted Done Reply Inline Actions Two extra spaces to indent the \| is the predominant style. jrtc27: Two extra spaces to indent the \| is the predominant style.

				jrtc27Unsubmitted Done Reply Inline Actions Please style these in the same way as the other RISC-V CodeGen tests, in terms of argument order, redirecting stdin rather than using `-o - %s`, and using riscv64 rather than riscv64-linux-gnu (unless needed?). Also use update_llc_test_checks.py rather than hand-writing this. And we generally use RV32I and RV64I (or other appropriate arch strings) instead of RISCV32 and RISCV64 prefixes. jrtc27: Please style these in the same way as the other RISC-V CodeGen tests, in terms of argument…
				jrtc27Unsubmitted Done Reply Inline Actions Delete this blank line. jrtc27: Delete this blank line.
				; RUN: llc -mtriple=riscv64-unknown-elf -mattr=+reserve-x18 -verify-machineinstrs < %s \
				; RUN: \| FileCheck %s --check-prefix=RV64

				define void @f1() shadowcallstack {
				; RV32-LABEL: f1:
				; RV32: # %bb.0:
				; RV32-NEXT: ret
				; RV32-NOT: x18
				pccUnsubmitted Done Reply Inline Actions Shouldn't it be looking for `s2` since that's how `x18` is spelled in assembly? pcc: Shouldn't it be looking for `s2` since that's how `x18` is spelled in assembly?
				jrtc27Unsubmitted Done Reply Inline Actions The -NOTs shouldn't even exist, this isn't how you use `update_llc_test_checks.py`. But yes, by default that's how it'll be printed unless you disable printing aliases. jrtc27: The -NOTs shouldn't even exist, this isn't how you use `update_llc_test_checks.py`. But yes, by…
				;
				; RV64-LABEL: f1:
				; RV64: # %bb.0:
				; RV64-NEXT: ret
				; RV64-NOT: x18
				ret void
				}

				declare void @foo()

				define void @f2() shadowcallstack {
				; RV32-LABEL: f2:
				; RV32: # %bb.0:
				; RV32-NEXT: tail foo
				; RV32-NOT: x18
				;
				; RV64-LABEL: f2:
				; RV64: # %bb.0:
				; RV64-NEXT: tail foo
				; RV64-NOT: x18
				tail call void @foo()
				ret void
				}

				declare i32 @bar()

				define i32 @f3() shadowcallstack {
				; RV32-LABEL: f3:
				; RV32: # %bb.0:
				; RV32-NEXT: sw ra, 0(s2)
				; RV32-NEXT: addi s2, s2, 4
				; RV32-NEXT: addi sp, sp, -16
				; RV32-NEXT: .cfi_def_cfa_offset 16
				; RV32-NEXT: sw ra, 12(sp)
				; RV32-NEXT: .cfi_offset ra, -4
				; RV32-NEXT: call bar
				; RV32-NEXT: lw ra, 12(sp)
				; RV32-NEXT: addi sp, sp, 16
				; RV32-NEXT: addi s2, s2, -4
				; RV32-NEXT: lw ra, 0(s2)
				; RV32-NEXT: ret
				;
				; RV64-LABEL: f3:
				; RV64: # %bb.0:
				; RV64-NEXT: sw ra, 0(s2)
				; RV64-NEXT: addi s2, s2, 8
				; RV64-NEXT: addi sp, sp, -16
				; RV64-NEXT: .cfi_def_cfa_offset 16
				; RV64-NEXT: sd ra, 8(sp)
				; RV64-NEXT: .cfi_offset ra, -8
				; RV64-NEXT: call bar
				; RV64-NEXT: ld ra, 8(sp)
				; RV64-NEXT: addi sp, sp, 16
				; RV64-NEXT: addi s2, s2, -8
				; RV64-NEXT: lw ra, 0(s2)
				; RV64-NEXT: ret
				%res = call i32 @bar()
				%res1 = add i32 %res, 1
				ret i32 %res
				}

				define i32 @f4() shadowcallstack {
				; RV32-LABEL: f4:
				; RV32: # %bb.0:
				; RV32-NEXT: sw ra, 0(s2)
				; RV32-NEXT: addi s2, s2, 4
				; RV32-NEXT: addi sp, sp, -16
				; RV32-NEXT: .cfi_def_cfa_offset 16
				; RV32-NEXT: sw ra, 12(sp)
				; RV32-NEXT: sw s0, 8(sp)
				; RV32-NEXT: sw s1, 4(sp)
				; RV32-NEXT: sw s3, 0(sp)
				; RV32-NEXT: .cfi_offset ra, -4
				; RV32-NEXT: .cfi_offset s0, -8
				; RV32-NEXT: .cfi_offset s1, -12
				; RV32-NEXT: .cfi_offset s3, -16
				; RV32-NEXT: call bar
				; RV32-NEXT: mv s3, a0
				; RV32-NEXT: call bar
				; RV32-NEXT: mv s1, a0
				; RV32-NEXT: call bar
				; RV32-NEXT: mv s0, a0
				; RV32-NEXT: call bar
				; RV32-NEXT: add a1, s3, s1
				; RV32-NEXT: add a0, s0, a0
				; RV32-NEXT: add a0, a1, a0
				; RV32-NEXT: lw s3, 0(sp)
				; RV32-NEXT: lw s1, 4(sp)
				; RV32-NEXT: lw s0, 8(sp)
				; RV32-NEXT: lw ra, 12(sp)
				; RV32-NEXT: addi sp, sp, 16
				; RV32-NEXT: addi s2, s2, -4
				; RV32-NEXT: lw ra, 0(s2)
				; RV32-NEXT: ret
				;
				; RV64-LABEL: f4:
				; RV64: # %bb.0:
				; RV64-NEXT: sw ra, 0(s2)
				; RV64-NEXT: addi s2, s2, 8
				; RV64-NEXT: addi sp, sp, -32
				; RV64-NEXT: .cfi_def_cfa_offset 32
				; RV64-NEXT: sd ra, 24(sp)
				; RV64-NEXT: sd s0, 16(sp)
				; RV64-NEXT: sd s1, 8(sp)
				; RV64-NEXT: sd s3, 0(sp)
				; RV64-NEXT: .cfi_offset ra, -8
				; RV64-NEXT: .cfi_offset s0, -16
				; RV64-NEXT: .cfi_offset s1, -24
				; RV64-NEXT: .cfi_offset s3, -32
				; RV64-NEXT: call bar
				; RV64-NEXT: mv s3, a0
				; RV64-NEXT: call bar
				; RV64-NEXT: mv s1, a0
				; RV64-NEXT: call bar
				; RV64-NEXT: mv s0, a0
				; RV64-NEXT: call bar
				; RV64-NEXT: add a1, s3, s1
				; RV64-NEXT: add a0, s0, a0
				; RV64-NEXT: addw a0, a1, a0
				; RV64-NEXT: ld s3, 0(sp)
				; RV64-NEXT: ld s1, 8(sp)
				; RV64-NEXT: ld s0, 16(sp)
				; RV64-NEXT: ld ra, 24(sp)
				; RV64-NEXT: addi sp, sp, 32
				; RV64-NEXT: addi s2, s2, -8
				; RV64-NEXT: lw ra, 0(s2)
				; RV64-NEXT: ret
				%res1 = call i32 @bar()
				%res2 = call i32 @bar()
				%res3 = call i32 @bar()
				%res4 = call i32 @bar()
				%res12 = add i32 %res1, %res2
				%res34 = add i32 %res3, %res4
				%res1234 = add i32 %res12, %res34
				ret i32 %res1234
				}

				define i32 @f5() shadowcallstack nounwind {
				; RV32-LABEL: f5:
				; RV32: # %bb.0:
				; RV32-NEXT: sw ra, 0(s2)
				; RV32-NEXT: addi s2, s2, 4
				; RV32-NEXT: addi sp, sp, -16
				; RV32-NEXT: sw ra, 12(sp)
				; RV32-NEXT: call bar
				; RV32-NEXT: lw ra, 12(sp)
				; RV32-NEXT: addi sp, sp, 16
				; RV32-NEXT: addi s2, s2, -4
				; RV32-NEXT: lw ra, 0(s2)
				; RV32-NEXT: ret
				;
				; RV64-LABEL: f5:
				; RV64: # %bb.0:
				; RV64-NEXT: sw ra, 0(s2)
				; RV64-NEXT: addi s2, s2, 8
				; RV64-NEXT: addi sp, sp, -16
				; RV64-NEXT: sd ra, 8(sp)
				; RV64-NEXT: call bar
				; RV64-NEXT: ld ra, 8(sp)
				; RV64-NEXT: addi sp, sp, 16
				; RV64-NEXT: addi s2, s2, -8
				; RV64-NEXT: lw ra, 0(s2)
				; RV64-NEXT: ret
				%res = call i32 @bar()
				%res1 = add i32 %res, 1
				ret i32 %res
				}

This is an archive of the discontinued LLVM Phabricator instance.

[RISCV] Support Shadow Call StackClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 280210

clang/lib/Driver/SanitizerArgs.cpp

clang/lib/Driver/ToolChain.cpp

clang/test/CodeGen/shadowcallstack-attr.c

clang/test/Driver/sanitizer-ld.c

llvm/lib/Target/RISCV/RISCVFrameLowering.cpp

llvm/test/CodeGen/RISCV/shadowcallstack.ll

[RISCV] Support Shadow Call Stack
ClosedPublic