This is an archive of the discontinued LLVM Phabricator instance.

[X86][MC][bugfix] Report error for mismatched modifier in inline asm and remove function getX86SubSuperRegisterOrZero
ClosedPublic

Authored by skan on Jan 28 2023, 11:46 PM.

Details

Summary
MCRegister getX86SubSuperRegister*(MCRegister Reg, unsigned Size,
                                  bool High = false);

A strange behavior of the functions getX86SubSuperRegister* was
introduced by llvm-svn:145579: The returned register may not
match the parameters when a 8-bit high register is required.

And llvm-svn: 175762 refined the code and dropped the comments, then we
knew nothing happened there from the code :-(

These two functions are only called with Size=8 and High=true in two places.
One is in X86FixupBWInsts.cpp for liveness of registers and the other is in
X86AsmPrinter.cpp for inline asm.

For the first one, we provide an alternative in this patch.
For the second one, the strange behaviour caused a bug that an erorr was not reported for mismatched modifier.

void f() {
  char x;
  asm volatile ("mov %%ah, %h0" :"=r"(x)::"%eax", "%ebx", "%ecx", "%edx", "edi", "esi");
}
$ gcc -S test.c

error: extended registers have no high halves 
$ clang -S test.c

no error

so we fix the bug in this patch.

getX86SubSuperRegister is just a wrapper of getX86SubSuperRegisterOrZero with a assert.
I belive we should remove the latter.

Diff Detail

Event Timeline

skan created this revision.Jan 28 2023, 11:46 PM
Herald added a project: Restricted Project. · View Herald TranscriptJan 28 2023, 11:46 PM
skan requested review of this revision.Jan 28 2023, 11:46 PM
Herald added a project: Restricted Project. · View Herald TranscriptJan 28 2023, 11:46 PM
skan retitled this revision from [RFC] Update and clarify the behavoir of functions getX86SubSuperRegister* to [RFC] Update and clarify the behavior of functions getX86SubSuperRegister*.Jan 28 2023, 11:50 PM
skan edited the summary of this revision. (Show Details)
skan updated this revision to Diff 493073.Jan 29 2023, 12:04 AM

Fix typo in comments

skan retitled this revision from [RFC] Update and clarify the behavior of functions getX86SubSuperRegister* to [RFC][X86][MC] Update and clarify the behavior of functions getX86SubSuperRegister*.Jan 29 2023, 12:06 AM

The change doesn't seem correct. It seems the code is used for special case, see https://github.com/llvm/llvm-project/commit/e93249befaa2383cc34c27771265123090cda085

skan added a comment.Jan 29 2023, 5:11 AM

@pengfei I noticed and mentioned this commit in the summary. I think the special purpose is only for AH, DH, CH and DH b/c they are the only 8-bit high registers that are not artifical. For others, X86 backend does not have subregister liveness tracking enabled, so either 16 or 64 can work.
There is no reason to make a difference for SI, DI, BP and SP.

skan added a comment.Jan 29 2023, 5:16 AM

@pengfei I noticed and mentioned this commit in the summary. I think the special purpose is only for AH, DH, CH and DH b/c they are the only 8-bit high registers that are not artifical. For others, X86 backend does not have subregister liveness tracking enabled, so either 16 or 64 can work.
There is no reason to make a difference for SI, DI, BP and SP.

I checked that there is no LIT test fail no matter we use getX86SubSuperRegisterOrZero(Reg, 16) or getX86SubSuperRegisterOrZero(Reg, 64) here.

skan updated this revision to Diff 493087.Jan 29 2023, 5:41 AM

Not use report_fatal_error b/c it's unrelated to user input

skan edited the summary of this revision. (Show Details)Jan 29 2023, 5:42 AM

Not use report_fatal_error b/c it's unrelated to user input

There will be difference with dedicated cases, e.g.: https://godbolt.org/z/oca6bj4s3

skan added a comment.Jan 29 2023, 7:31 AM

Not use report_fatal_error b/c it's unrelated to user input

There will be difference with dedicated cases, e.g.: https://godbolt.org/z/oca6bj4s3

It is an impossible case. We can only access 8-bit high register for register a, b, c d in inline asm.

void f() {
  char x;
  asm volatile ("mov %h0, %h0" :"=r"(x)::);
  asm volatile ("mov %h0, %h0" :"=r"(x)::"%eax", "%ebx", "%ecx", "%edx", "edi", "esi");
}
bash$ gcc -S 1.c -o -
        .file   "1.c"
        .text
        .globl  f
        .type   f, @function
f:
.LFB0:
        .cfi_startproc
        pushq   %rbp
        .cfi_def_cfa_offset 16
        .cfi_offset 6, -16
        movq    %rsp, %rbp
        .cfi_def_cfa_register 6
        pushq   %rbx
        .cfi_offset 3, -24
#APP
# 3 "1.c" 1
        mov %ah, %ah
# 0 "" 2
#NO_APP
        movb    %al, -9(%rbp)
#APP
# 4 "1.c" 1
1.c: In function ‘f’:
1.c:5:1: error: extended registers have no high halves

Not use report_fatal_error b/c it's unrelated to user input

There will be difference with dedicated cases, e.g.: https://godbolt.org/z/oca6bj4s3

It is an impossible case. We can only access 8-bit high register for register a, b, c d in inline asm.

void f() {
  char x;
  asm volatile ("mov %h0, %h0" :"=r"(x)::);
  asm volatile ("mov %h0, %h0" :"=r"(x)::"%eax", "%ebx", "%ecx", "%edx", "edi", "esi");
}
bash$ gcc -S 1.c -o -
        .file   "1.c"
        .text
        .globl  f
        .type   f, @function
f:
.LFB0:
        .cfi_startproc
        pushq   %rbp
        .cfi_def_cfa_offset 16
        .cfi_offset 6, -16
        movq    %rsp, %rbp
        .cfi_def_cfa_register 6
        pushq   %rbx
        .cfi_offset 3, -24
#APP
# 3 "1.c" 1
        mov %ah, %ah
# 0 "" 2
#NO_APP
        movb    %al, -9(%rbp)
#APP
# 4 "1.c" 1
1.c: In function ‘f’:
1.c:5:1: error: extended registers have no high halves

I'm not sure if there's special requirment in LLVM. If we want to match with gcc, should we just need to replace return getX86SubSuperRegisterOrZero(Reg, 64); with return X86::NoRegister?

skan added a comment.Jan 29 2023, 5:54 PM

Not use report_fatal_error b/c it's unrelated to user input

There will be difference with dedicated cases, e.g.: https://godbolt.org/z/oca6bj4s3

It is an impossible case. We can only access 8-bit high register for register a, b, c d in inline asm.

void f() {
  char x;
  asm volatile ("mov %h0, %h0" :"=r"(x)::);
  asm volatile ("mov %h0, %h0" :"=r"(x)::"%eax", "%ebx", "%ecx", "%edx", "edi", "esi");
}
bash$ gcc -S 1.c -o -
        .file   "1.c"
        .text
        .globl  f
        .type   f, @function
f:
.LFB0:
        .cfi_startproc
        pushq   %rbp
        .cfi_def_cfa_offset 16
        .cfi_offset 6, -16
        movq    %rsp, %rbp
        .cfi_def_cfa_register 6
        pushq   %rbx
        .cfi_offset 3, -24
#APP
# 3 "1.c" 1
        mov %ah, %ah
# 0 "" 2
#NO_APP
        movb    %al, -9(%rbp)
#APP
# 4 "1.c" 1
1.c: In function ‘f’:
1.c:5:1: error: extended registers have no high halves

I'm not sure if there's special requirment in LLVM. If we want to match with gcc, should we just need to replace return getX86SubSuperRegisterOrZero(Reg, 64); with return X86::NoRegister?

I think so. And it looks like one of getX86SubSuperRegisterOrZero and getX86SubSuperRegister is redundant, let me do some clean work.

skan updated this revision to Diff 493197.Jan 29 2023, 9:13 PM

Remove function getX86SubSuperRegisterOrZero

skan retitled this revision from [RFC][X86][MC] Update and clarify the behavior of functions getX86SubSuperRegister* to [X86][MC][bugfix] Report error for mismatched modifier in inline asm and remove function getX86SubSuperRegisterOrZero.Jan 29 2023, 9:35 PM
skan edited the summary of this revision. (Show Details)
skan edited the summary of this revision. (Show Details)Jan 29 2023, 9:37 PM
craig.topper added inline comments.Jan 29 2023, 9:50 PM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
745–746

Didn't the old name more accurately describe this function?

llvm/lib/Target/X86/X86AsmPrinter.cpp
545

Do we need to assert that Reg is valid? We used to have one.

549

Reg.isValid()

llvm/lib/Target/X86/X86FixupBWInsts.cpp
219

HighReg.isValid()

llvm/lib/Target/X86/X86ISelLowering.cpp
57580

DestReg.isValid()

skan added inline comments.Jan 29 2023, 10:19 PM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
745–746

I don't think so

  1. X86::NoRegister is just a invalid value and it doesn't have to be zero
  2. When we talk about a API like getXXX, we can expect it to return an invalid value, e.g we use getCondCode rather than getCondCodeOrInvalidCond
llvm/lib/Target/X86/X86AsmPrinter.cpp
545

https://llvm.org/docs/CodingStandards.html#assert-liberally

assert is disabled for release build. I belive it is only used to report internal error for LLVM developers and it should be used only when the developer has concern.
We have checked the register class of Reg at line 536-539, so Reg is always valid here obviously. I think assertion is no need here.

skan updated this revision to Diff 493205.Jan 29 2023, 10:30 PM
skan marked 2 inline comments as done.

Address review comments

craig.topper added inline comments.Jan 29 2023, 10:37 PM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
748

Why do we allow invalid sizes?

873

This should be NoRegister

llvm/lib/Target/X86/X86AsmPrinter.cpp
545

Did we lose the assertion in other places too? Unfortunately due to the rename, I can't see all callers of the old getX86SubSuperRegister in the review.

skan added inline comments.Jan 29 2023, 10:54 PM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
748

I think the previous author was overly conservative here. Let's use a llvm_unreachable here.

llvm/lib/Target/X86/X86AsmPrinter.cpp
545

Yes, we lose it in other places too. There is only a few places that need the assertion, let me add it.

skan updated this revision to Diff 493211.Jan 29 2023, 11:48 PM
skan marked 4 inline comments as done.

Address reivew comments

skan added inline comments.Jan 29 2023, 11:56 PM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
748

Why do we allow invalid sizes?

The test CodeGen/X86/asm-reject-reg-type-mismatch.ll will fail if we use llvm_unreachable here.

; CHECK: error: couldn't allocate output register for constraint '{ax}'
define i128 @blup() {
  %v = tail call i128 asm "", "={ax},0"(i128 0)
  ret i128 %v
}

See more details here https://reviews.llvm.org/D10813

pengfei added inline comments.Jan 30 2023, 12:57 AM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
748

Is it caused by error message not match?

skan added inline comments.Jan 30 2023, 7:07 AM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
748

Is it caused by error message not match?

It is caused by the crash due to the llvm_unreachable at line 748.

UNREACHABLE executed at llvm-project/llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp:748!

getX86SubSuperRegister is called with Size 128, and an inline asm error is emitted in SelectionDAGBuilder.cpp when the returned register is not valid.
If we used "llvm_unreachable" at line 748, the llc would crash before emitting an error.

pengfei added inline comments.Jan 30 2023, 7:14 AM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
748

Will it be solved by adding --crash after not?

skan added inline comments.Jan 30 2023, 7:45 AM
llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp
748

Yes, we also need to move other tests in asm-reject-reg-type-mismatch.ll into another file

; CHECK: error: couldn't allocate input reg for constraint 'r'
define void @fp80(x86_fp80) {
  tail call void asm sideeffect "", "r"(x86_fp80 %0)
  ret void
}

; CHECK: error: couldn't allocate input reg for constraint 'f'
define void @f_constraint_i128(ptr %0) {
  %2 = load i128, ptr %0, align 16
  tail call void asm sideeffect "", "f"(i128 %2)
  ret void
}

b/c when llc crashes, the remaining file will not be processed. But, should we do such thing?

I think we can just remove the function blup and the check in this test b/c it is impossible case. Does it sound good to you?

skan updated this revision to Diff 493478.Jan 30 2023, 7:50 PM

Use llvm_unreachable for illeagl register size

craig.topper added inline comments.Jan 30 2023, 7:58 PM
llvm/test/CodeGen/X86/asm-reject-reg-type-mismatch.ll
4 ↗(On Diff #493478)

Is it not possible to do this with __int128 in C?

craig.topper added inline comments.Jan 30 2023, 8:01 PM
llvm/test/CodeGen/X86/asm-reject-reg-type-mismatch.ll
4 ↗(On Diff #493478)
void foo() {
    __int128 a;
    asm volatile ("" : "+a"(a));
}
skan added inline comments.Jan 30 2023, 8:52 PM
llvm/test/CodeGen/X86/asm-reject-reg-type-mismatch.ll
4 ↗(On Diff #493478)
void foo() {
    __int128 a;
    asm volatile ("" : "+a"(a));
}

Good example!

skan updated this revision to Diff 493483.Jan 30 2023, 9:06 PM
skan marked 2 inline comments as done.

Address review comments: add a test back

skan added a comment.Jan 30 2023, 9:11 PM

@craig.topper I keep the llvm_unreachable by using a pre-check in X86ISelLowering.cpp. It's the only place I found the getX86SubSuperRegister is called with illeagl size.

pengfei accepted this revision.Feb 1 2023, 4:23 AM

LGTM.

This revision is now accepted and ready to land.Feb 1 2023, 4:23 AM
This revision was landed with ongoing or failed builds.Feb 1 2023, 6:09 PM
This revision was automatically updated to reflect the committed changes.