Refer the MS ARM64 ABI Convention for the behavior for struct returns:
https://docs.microsoft.com/en-us/cpp/build/arm64-windows-abi-conventions#return-values
Details
Diff Detail
Event Timeline
lib/CodeGen/CGCall.cpp | ||
---|---|---|
1989 | We shouldn't be checking the triple here. Can we extend CGFunctionInfo to record this instead, then perform the check in AArch64ABIInfo in lib/CodeGen/TargetInfo.cpp? | |
1991 | Are you sure isPOD() is the right check? I wouldn't trust the spec; for other Microsoft targets in other similar situations, we usually check whether the class has a trivial copy constructor, or something like that; see MicrosoftCXXABI::getRecordArgABI. |
lib/CodeGen/CGCall.cpp | ||
---|---|---|
1989 | One thing we can do is to add a new ABIKind in AArch64ABIInfo, and test it as the condition | |
1991 | Do you have any information about why isPOD is not the right check? The POD definition is defined as the spec. I am seeing clang has many places to setup PlainOldData = false. It doesn't look like it is just about testing constructor for me. |
lib/CodeGen/CGCall.cpp | ||
---|---|---|
1991 | Just verified, you're right, POD is actually the right check. It's just a bit weird because there isn't any technical reason for the calling convention code to check whether the class has, for example, a trivial default constructor. |
I think you may want to use a different IR convention to indicate whether X0 or X8 should be used for the pointer to the returned struct. My first thought is to use inreg to indicate that the struct is POD and X8 should be used, and that it does not need to be preserved. In AArch64ABIInfo, you would check for POD-ness and use ABIArgInfo::getIndirectInReg instead of ABIArgInfo::getIndirect.
We should also think about the best way to represent the fact that the address of a non-POD return value must be returned in X0. For X86, the sret attribute is used to communicate this requirement. Another alternative would be to change the LLVM IR function return type to a pointer type and add the LLVM IR returned attribute to the argument with the address of the return value. If we decide not to go with sret + inreg, we need to do this anyway, since the backend has no way of knowing it needs to preserve X0.
lib/CodeGen/CGCall.cpp | ||
---|---|---|
1989 | ABIKind is an implementation detail of AArch64ABIInfo. I'm not sure how we would use that here. |
lib/CodeGen/TargetInfo.cpp | ||
---|---|---|
4926 | Do we should limit this to aarch64 windows only? I feel it will impact x86 64bit . |
lib/CodeGen/TargetInfo.cpp | ||
---|---|---|
4926 | I just realized Win64 is under AArch64ABIInfo. So this change should be good. |
include/clang/CodeGen/CGFunctionInfo.h | ||
---|---|---|
550 | Use a bit field like ReturnsRetained above? |
My first thought is to use inreg to indicate that the struct is POD and X8 should be used, and that it does not need to be preserved.
If we were going to do this, we'd probably want to reverse the meaning of inreg; I would rather not break compatibility with existing bitcode, particularly on non-Windows targets.
Another alternative would be to change the LLVM IR function return type to a pointer type and add the LLVM IR returned attribute to the argument with the address of the return value.
We do something similar to this for constructors on some targets; see CGCXXABI::HasThisReturn. Probably not hard to extend if we wanted to.
I see, I hadn't realized X8 was the normal sret register. Yes, sret + inreg would mean "in the other sret register". However, now I think we should toss this idea and use returned instead, since SysV AArch64 doesn't return the incoming sret pointer in X0, so we'd need to make this new sret+inreg thing imply that.
Another alternative would be to change the LLVM IR function return type to a pointer type and add the LLVM IR returned attribute to the argument with the address of the return value.
We do something similar to this for constructors on some targets; see CGCXXABI::HasThisReturn. Probably not hard to extend if we wanted to.
Yep.
Reid, are you okay with merging something like the current patch for now, and implementing returning the value as a followup?
lib/CodeGen/CGCall.cpp | ||
---|---|---|
1990 | It would be better to move the isPOD check itself into target/ABI-specific code, more like setSRetAfterThis. But not important either way, I guess. |
Yeah.
Please add some test cases that verify that the right thing happens for instance methods returning POD structs. Should the sret parameter be in X1 or X8? Looks like you want X1, so isPOD isn't the right check.
struct IsPOD { int x; }; struct Foo { IsPOD foo() { return IsPOD{3}; } }; int main() { return Foo().foo().x; }
include/clang/CodeGen/CGFunctionInfo.h | ||
---|---|---|
550 | Yes, please do not grow this struct needlessly. Please name it something more generic like SuppressSRet. | |
lib/CodeGen/TargetInfo.cpp | ||
4926 | Based on my test case, you should call setSuppressSRet(true) whenever ::classifyReturnType returns true in the MS C++ ABI. |
include/clang/CodeGen/CGFunctionInfo.h | ||
---|---|---|
521–523 | This should be a member of the ABIArgInfo, not CGFunctionInfo. |
The ABI doc just distinguishes between POD and non-POD types.
The address of the memory block shall be passed as an additional argument to the function in x8 for POD type, or in x0 (or x1 if $this is passed in x0) for non-POD type.
So even for instance methods returning PODs, X8 should be used.
Experimentation shows otherwise: https://godbolt.org/g/jQVW78 X1 is used to pass the sret parameter. Trust but verify. :)
Thanks Reid. Yes, you are correct. Instance methods seem to use X1 for struct returns. I have updated my patch accordingly. I guess the doc needs fixing.
@TomTan @haripul As Reid pointed out, MS ARM64 seems to use X1 for returning POD struct address. See https://godbolt.org/g/jQVW78.
However, the doc at https://docs.microsoft.com/en-us/cpp/build/arm64-windows-abi-conventions#return-values only mentions the following:
The address of the memory block shall be passed as an additional argument to the function in x8 for POD type, or in x0 (or x1 if $this is passed in x0) for non-POD type.
Could you please check if the doc needs to be fixed?
Maybe enforce isIndirect(), like we do for other similar flags?