This is an archive of the discontinued LLVM Phabricator instance.

[clang][Diagnostics] Show line numbers when printing code snippets
ClosedPublic

Authored by tbaeder on Apr 9 2023, 1:28 AM.

Details

Summary

Just putting this here because it's a pretty simple patch as it turns out.

Done:

  1. Increase -fcaret-diagnostics-max-lines default to 16.
  2. Add -fdiagnostics-show-line-numbers to disable the margin with the line numbers on the left
  3. Actually show the line numbers in all diagnostics.
  4. Fix up tests.

Sample output:

./array.cpp:98:1: error: static assertion failed due to requirement 'func()'
  98 | static_assert(
     | ^
  99 |     // I wonder if this works.
 100 |     // Also, what if it doesn't?
 101 |     func());
     |     ~~~~~~
1 error generated.

Diff Detail

Event Timeline

tbaeder created this revision.Apr 9 2023, 1:28 AM
Herald added a project: Restricted Project. · View Herald TranscriptApr 9 2023, 1:28 AM
tbaeder requested review of this revision.Apr 9 2023, 1:28 AM
Herald added a project: Restricted Project. · View Herald TranscriptApr 9 2023, 1:28 AM
Herald added a subscriber: cfe-commits. · View Herald Transcript
tbaeder added inline comments.Apr 9 2023, 1:28 AM
clang/lib/Frontend/TextDiagnostic.cpp
1139

Wrote this just to get things going but I hope there's something better than this...?

tbaeder updated this revision to Diff 512024.Apr 9 2023, 10:14 AM
tbaeder edited the summary of this revision. (Show Details)

GCC emits line numbers by default these days, so it seems reasonable for Clang to do so as well. Having a way to disable the feature is important though as some folks have scripts that scrap diagnostic results and changing the format may break those scripts in challenging ways. GCC supports -fno-diagnostics-show-line-numbers and we should do the same. GCC also supports -fdiagnostics-minimum-margin-width=width which we might want to consider supporting up front as well.

As for -fcaret-diagnostics-max-lines, I think I agree that the current default of 1 makes this functionality a bit less than useful, but perhaps we want to change that default value now? Alternatively, do we want to tie this option to the number of max lines? e.g., if the max lines is one, then we disable line numbers by default, but if the max lines is > 1, we enable line numbers by default?

cjdb added a comment.Apr 10 2023, 10:05 AM

I have no opinion on this addition. If folks find it useful in GCC, happy to see it added.

clang/lib/Frontend/TextDiagnostic.cpp
1137

I'd prefer this to be an assert rather than returning a value. We should never be able to return from here.

tbaeder marked an inline comment as done.Apr 11 2023, 5:24 AM

Wow, in FixIt/fixits-function-call, this output:

./array.cpp:123:3: error: no matching function for call to 'f1'
  f1(a + 1);
  ^~
./array.cpp:114:6: note: candidate function not viable: no known conversion from 'intTy2 *' (aka 'int *') to 'intTy &' (aka 'int &') for 1st argument; dereference the argument with *
void f1(intTy &a);
     ^
fix-it:"./array.cpp":{123:6-123:6}:"*("
fix-it:"./array.cpp":{123:11-123:11}:")"

now (with increased snippet line limit) prints:

./array.cpp:123:3: error: no matching function for call to 'f1'
  123 |   f1(a + 1);
      |   ^~
./array.cpp:114:6: note: candidate function not viable: no known conversion from 'intTy2 *' (aka 'int *') to 'intTy &' (aka 'int &') for 1st argument; dereference the argument with *
  114 | void f1(intTy &a);
      |      ^
  116 | void f2(intTy2 *a) {
  117 | // CHECK: error: no matching function for call to 'f1
  118 | // CHECK: dereference the argument with *
  119 | // CHECK: void f1(intTy &a);
  120 | // CHECK: fix-it{{.*}}*(
  121 | // CHECK-NEXT: fix-it{{.*}})
  122 | // CHECK: void f1(double *a);
  123 |   f1(a + 1);
      |      ~~~~~
      |      *(   )
fix-it:"./array.cpp":{123:6-123:6}:"*("
fix-it:"./array.cpp":{123:11-123:11}:")"

Is this actually what we want? It somehow makes sense but also doesn't. It's weird that the test ensures the parseable fixits but the output doesn't show the fixit at all (in the first case).

tbaeder updated this revision to Diff 512784.Apr 12 2023, 5:05 AM
tbaeder set the repository for this revision to rG LLVM Github Monorepo.
tbaeder updated this revision to Diff 512785.Apr 12 2023, 5:07 AM
tbaeder updated this revision to Diff 512795.Apr 12 2023, 5:38 AM

Since basically nobody has seen multiple snippet lines being printed, we probably need to fix up a bunch of diagnostics afterwards to be more useful, e.g.

./array.cpp:111:5: error: no matching function for call to 'func'
  111 |     func(1, 2, 3, 4), "hah, reason!");
      |     ^~~~
./array.cpp:96:16: note: candidate function not viable: requires 7 arguments, but 4 were provided
   96 | consteval bool func(int why,
      |                ^    ~~~~~~~~
   97 |                     int would_my,
      |                     ~~~~~~~~~~~~~
   98 |                     int compiler_complain,
      |                     ~~~~~~~~~~~~~~~~~~~~~~
   99 |                     int about_the_arguments,
      |                     ~~~~~~~~~~~~~~~~~~~~~~~~
  100 |                     int but_not_show_them_to_me,
      |                     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  101 |                     int if_the_declaration_covers_multiple_lines,
      |                     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  102 |                     int big_question_mark) {
      |                     ~~~~~~~~~~~~~~~~~~~~~
1 error generated.

has annoyed me in the past (using one argument per line is common in some coding styles).

tbaeder updated this revision to Diff 513144.Apr 13 2023, 3:12 AM
tbaeder updated this revision to Diff 514954.Apr 19 2023, 7:46 AM
tbaeder updated this revision to Diff 515242.Apr 20 2023, 12:55 AM

Fix all existing tests

CCing @njames93 since I have no idea how to properly fix the clang-tidy test failures.

tbaeder retitled this revision from [clang][Diagnostics] WIP: Show line numbers when printing code snippets to [clang][Diagnostics] Show line numbers when printing code snippets.Apr 26 2023, 1:04 AM
tbaeder set the repository for this revision to rG LLVM Github Monorepo.

CCing @njames93 since I have no idea how to properly fix the clang-tidy test failures.

I look into some of those tests, there are 1 way to fix them. Tried to -fno-diagnostics-show-line-numbers but clang-tidy got history of ignoring those diagnostic flags, I thing there are some issues open for it.

In those tests you got thing like this:

// CHECK-MESSAGES-NEXT: ^~~
// CHECK-MESSAGES-NEXT: {{^ *}}LU{{$}}

this could be changed into something like this:

// CHECK-MESSAGES-NEXT: ^~~
// CHECK-MESSAGES-NEXT: {{^ *\| *}}LU{{$}}

Simply this matching beginning of a line. by using ^ *\| * we match: beginning of line, any number of white spaces, character | any number of white spaces.

tbaeder updated this revision to Diff 517450.Apr 26 2023, 10:53 PM
Herald added a project: Restricted Project. · View Herald TranscriptApr 26 2023, 10:53 PM
tbaeder updated this revision to Diff 517463.Apr 27 2023, 12:07 AM
PiotrZSL added inline comments.Apr 27 2023, 12:09 AM
clang-tools-extra/test/clang-tidy/checkers/cert/uppercase-literal-suffix-integer.cpp
34 ↗(On Diff #517450)

In theory you should escape this character (|) by using \, otherwise it could be interpreted as an regexp or. Still looks like this test file passes, so 7 to go.

tbaeder added inline comments.Apr 27 2023, 12:10 AM
clang-tools-extra/test/clang-tidy/checkers/cert/uppercase-literal-suffix-integer.cpp
34 ↗(On Diff #517450)

Yeah I noticed the missing backslash but the test passed.

For the other tests, I've just removed the {{^ *}} before the uppercase suffix. It seems pointless to check that the line contains nothing else(?). I can revert that of course, but it seemed like the easier fix.

tbaeder edited the summary of this revision. (Show Details)Apr 27 2023, 12:43 AM

Clang-tidy related changes looks fine.

aaron.ballman added a subscriber: Restricted Project.May 4 2023, 1:48 PM

This should definitely have explicit documentation added to https://github.com/llvm/llvm-project/blob/main/clang/docs/UsersManual.rst?plain=1#L156 and a release note.

Adding clang-vendors because of the potential that this is a breaking change for some folks doing diagnostic scraping; perhaps we should also mention this under potentially breaking changes in the release notes?

clang/include/clang/Driver/Options.td
2547

I'm hopeful that defaulting this to true (which matches the GCC behavior) is reasonable, but I'm a bit worried about how downstreams will react to this. Specifically, I'm wondering about things like clang-cl integration in MSVC or clang use in Xcode, where diagnostics are being displayed other than on the command line.

clang/test/FixIt/fixit-function-call.cpp
1 ↗(On Diff #517463)

Just to double-check, parseable diagnostic output still works even if line numbers are shown, yes?

kwk added a subscriber: kwk.May 4 2023, 2:17 PM

@tbaeder I have a small suggestion.

clang/lib/Frontend/TextDiagnostic.cpp
1122–1139

This function screamed at me to be generalized so I gave it a try: https://gist.github.com/kwk/7e408065ea291e49fea4a83cf90a9cdf

jrtc27 added a subscriber: jrtc27.May 4 2023, 2:46 PM
jrtc27 added inline comments.
clang/lib/Frontend/TextDiagnostic.cpp
1122–1139

I don't think I want floating point arithmetic in my compiler... Something like:

unsigned L, M;
for (L = 1U, M = 10U; N >= M && M != ~0U; ++L)
    M = (M > ((~0U) / 10U)) ? (~0U) : (M * 10U);

return (L);

is the generalised form (without all the redundant parentheses I added during debugging).

tbaeder added inline comments.May 5 2023, 3:53 AM
clang/test/FixIt/fixit-function-call.cpp
1 ↗(On Diff #517463)

Yes, at leat we don't prepend the line numbers to the parseable fixits:

   21 | // CHECK-NEXT: fix-it{{.*}})
   22 | // CHECK: void f1(double *a);
   23 |   f1(a + 1);
      |      ~~~~~
      |      *(   )
fix-it:"../clang/test/FixIt/fixit-function-call.cpp":{23:6-23:6}:"*("
fix-it:"../clang/test/FixIt/fixit-function-call.cpp":{23:11-23:11}:")"
tbaeder updated this revision to Diff 519799.May 5 2023, 4:11 AM

Add docs and a release note.

aaron.ballman added inline comments.May 5 2023, 9:08 AM
clang/lib/Frontend/TextDiagnostic.cpp
1122–1139

Cleaned up a bit:

constexpr unsigned getNumDisplayWidth(unsigned N) {
  unsigned L = 1U, M = 10U;
  constexpr unsigned Upper = ~0U / 10U;
  for (; N >= M && M != ~0U; ++L)
    M = M > Upper ? ~0U : M * 10U;
  return L;
}

https://godbolt.org/z/szTYsEM4v

clang/test/FixIt/fixit-function-call.cpp
1 ↗(On Diff #517463)

Excellent, thank you!

efriedma added inline comments.
clang/lib/Frontend/TextDiagnostic.cpp
1122–1139

Slightly less efficient, but easier to read:

unsigned getNumDisplayWidth(unsigned N) {
  unsigned Width = 1;
  for (; N >= 10; ++Width)
    N /= 10;
  return Width;
}
aaronpuchert added inline comments.
clang/include/clang/Driver/Options.td
2547

I don't like line number margins that much, but I think it should be Ok. What I've seen of compiler output parsers, they mainly concentrate on the <file>:<line>[:<column>]: <kind>: <message> lines and ignore code snippets. After all, once they have the source location, they can just look at the file.

clang/lib/Frontend/TextDiagnostic.cpp
1122–1139

I'd suggest to replace ~0U by std::numeric_limits<unsigned>::max(). And if we're looking at <limits>, why not ask it directly how far we need to go?

static unsigned getNumDisplayWidth(unsigned N) {
  unsigned L = 1u, M = 10u;
  constexpr unsigned Max = std::numeric_limits<unsigned>::digits10 + 1;
  for (; M <= N && L != Max; ++L)
    M *= 10u;
  return L;
}

This will let the overflow happen, but that should be fine for unsigned arithmetic. Without overflow:

static unsigned getNumDisplayWidth(unsigned N) {
  unsigned L = 1u, M = 10u;
  while (M <= N && L++ != std::numeric_limits<unsigned>::digits10)
    M *= 10u;
  return L;
}

The trick is that the increment is done even if the comparison fails and we exit the loop.

aaronpuchert added inline comments.May 5 2023, 5:25 PM
clang/lib/Frontend/TextDiagnostic.cpp
1122–1139

Though pre-increment also does it if we change the right-hand side:

while (M <= N && ++L != std::numeric_limits<unsigned>::digits10 + 1)
  M *= 10u;
aaron.ballman added inline comments.May 15 2023, 11:11 AM
clang/lib/Frontend/TextDiagnostic.cpp
1122–1139

+1 to the final suggestion that's fully generalized.

tbaeder updated this revision to Diff 522580.May 16 2023, 6:24 AM
tbaeder marked 7 inline comments as done.
This revision is now accepted and ready to land.May 16 2023, 6:31 AM
This revision was landed with ongoing or failed builds.May 16 2023, 8:13 AM
This revision was automatically updated to reflect the committed changes.
hans added a subscriber: hans.Jun 2 2023, 6:34 AM
hans added inline comments.
clang/docs/UsersManual.rst
578 ↗(On Diff #522634)

I think this is still a cc1-only option. Should it be made available as a driver option now?

tbaeder added inline comments.Jun 2 2023, 6:39 AM
clang/docs/UsersManual.rst
578 ↗(On Diff #522634)

No strong opinion from my side, but would certainly make sense to me.

dyung added a subscriber: dyung.Jun 2 2023, 11:11 PM
dyung added inline comments.
clang/docs/ReleaseNotes.rst
290 ↗(On Diff #522634)

Is this correct? I notice that all of the tests you update used the form of the option "-fcaret-diagnostics-max-lines n" rather than "-fcaret-diagnostics-max-lines=n". When I tried to update some private tests to use "-fcaret-diagnostics-max-lines=n", the compiler did not accept it, but when I changed it to use "-fcaret-diagnostics-max-lines n" it worked. I think we either need to update the option to accept both forms or to correct the documentation.

tbaeder added inline comments.Jun 2 2023, 11:19 PM
clang/docs/ReleaseNotes.rst
290 ↗(On Diff #522634)

Yes, you're right, that should be changed.

@hans @dyung I've opened https://reviews.llvm.org/D152090 which should fix both of your concerns.

hans added a comment.Jun 26 2023, 6:00 AM

I noticed that at least for some cases of -Wformat, the line numbers on the left seem to be off: https://github.com/llvm/llvm-project/issues/63524

I noticed that at least for some cases of -Wformat, the line numbers on the left seem to be off: https://github.com/llvm/llvm-project/issues/63524

That's because DisplayLineNo refers to the line of the diagnostic, not the first source line, and then counts up. What's the reason for DisplayLineNo in the first place? Given the printed line comes from LineNo, surely we should be using LineNo instead?