This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Driver/
-
clang/
-
Driver/
1/1
Driver.h
5/5
Options.td
-
lib/Driver/
-
Driver/
1/1
Driver.cpp
-
test/Driver/
-
Driver/
5/5
emit-reproducer.c
-
tools/driver/
-
driver/
6/7
driver.cpp

Differential D120201

[Clang] Extend -gen-reproducer flag
ClosedPublic

Authored by abrachet on Feb 19 2022, 8:08 PM.

Download Raw Diff

Details

Reviewers

hans
dblaikie
phosek
aaron.ballman
awarzynski
bruno

Commits

rG7d76d6095880: [Clang] Extend -gen-reproducer flag
rG684c08010876: [Clang] Extend -gen-reproducer flag

Summary

-gen-reproducer causes crash reproduction to be emitted even when clang didn't crash, and now can optionally take an argument of never, on-crash (default), on-error and always.

Diff Detail

Event Timeline

abrachet created this revision.Feb 19 2022, 8:08 PM

Herald added a subscriber: dang. · View Herald TranscriptFeb 19 2022, 8:08 PM

abrachet requested review of this revision.Feb 19 2022, 8:08 PM

Harbormaster completed remote builds in B150574: Diff 410128.Feb 19 2022, 8:46 PM

dblaikie added a comment.Feb 20 2022, 10:33 AM

This comment was removed by dblaikie.

What's the purpose of this? (it'll need test coverage before it's committed, but some design discussion could probably come before that)

dblaikie added a reviewer: aaron.ballman.Feb 20 2022, 10:34 AM

abrachet edited the summary of this revision. (Show Details)Feb 22 2022, 2:28 PM

In D120201#3334217, @dblaikie wrote:

What's the purpose of this? (it'll need test coverage before it's committed, but some design discussion could probably come before that)

I've tried to give more context in the description, hopefully it elaborates enough.

What other tests would you reckon are necessary here?

In D120201#3334217, @dblaikie wrote:

What's the purpose of this? (it'll need test coverage before it's committed, but some design discussion could probably come before that)

This is something that came up repeatedly when debugging compiler errors that show up in our CI. Currently, that usually requires reproducing the exact build so you can replay the compiler invocation and try various options to debug the issue. That process can take significant amount of time. In comparison, for Clang crashes debugging is much simpler because we automatically upload compiler reproducers to a cloud storage location and so reproducing those is simply a matter of downloading the reproducer and rerunning it locally. This got us thinking, could we make reproducing compiler errors as easy as reproducing compiler crashes, hence this feature.

In D120201#3338757, @phosek wrote:

In D120201#3334217, @dblaikie wrote:

What's the purpose of this? (it'll need test coverage before it's committed, but some design discussion could probably come before that)

This is something that came up repeatedly when debugging compiler errors that show up in our CI. Currently, that usually requires reproducing the exact build so you can replay the compiler invocation and try various options to debug the issue. That process can take significant amount of time. In comparison, for Clang crashes debugging is much simpler because we automatically upload compiler reproducers to a cloud storage location and so reproducing those is simply a matter of downloading the reproducer and rerunning it locally. This got us thinking, could we make reproducing compiler errors as easy as reproducing compiler crashes, hence this feature.

I forgot to mention, we would welcome other ideas or suggestions, this is just the direction that made most sense to us based on our experience.

xbolva00 added a subscriber: xbolva00.Feb 22 2022, 2:46 PM

xbolva00 added inline comments.

clang/include/clang/Driver/Options.td
1390	diagnostics
1392	source

Fixed typos

abrachet marked 2 inline comments as done.Feb 22 2022, 3:07 PM

abrachet added inline comments.

clang/include/clang/Driver/Options.td
1390	Thanks!

In D120201#3338763, @phosek wrote:

In D120201#3338757, @phosek wrote:

In D120201#3334217, @dblaikie wrote:

What's the purpose of this? (it'll need test coverage before it's committed, but some design discussion could probably come before that)

This is something that came up repeatedly when debugging compiler errors that show up in our CI. Currently, that usually requires reproducing the exact build so you can replay the compiler invocation and try various options to debug the issue. That process can take significant amount of time. In comparison, for Clang crashes debugging is much simpler because we automatically upload compiler reproducers to a cloud storage location and so reproducing those is simply a matter of downloading the reproducer and rerunning it locally. This got us thinking, could we make reproducing compiler errors as easy as reproducing compiler crashes, hence this feature.

I forgot to mention, we would welcome other ideas or suggestions, this is just the direction that made most sense to us based on our experience.

Fair enough - yeah, I can see the appeal, though does feel somewhat awkward. Maybe a more generic tool would be suitable? (I guess you might want to reproduce compilations that don't error too, for instance - how'd did I get an object file with this feature, why isn't this symbol produced here, etc) So maybe some sort of --save_temps mode would be suitable?

Harbormaster completed remote builds in B150946: Diff 410651.Feb 22 2022, 3:43 PM

In D120201#3338827, @dblaikie wrote:

In D120201#3338763, @phosek wrote:

In D120201#3338757, @phosek wrote:

In D120201#3334217, @dblaikie wrote:

What's the purpose of this? (it'll need test coverage before it's committed, but some design discussion could probably come before that)

This is something that came up repeatedly when debugging compiler errors that show up in our CI. Currently, that usually requires reproducing the exact build so you can replay the compiler invocation and try various options to debug the issue. That process can take significant amount of time. In comparison, for Clang crashes debugging is much simpler because we automatically upload compiler reproducers to a cloud storage location and so reproducing those is simply a matter of downloading the reproducer and rerunning it locally. This got us thinking, could we make reproducing compiler errors as easy as reproducing compiler crashes, hence this feature.

I forgot to mention, we would welcome other ideas or suggestions, this is just the direction that made most sense to us based on our experience.

Fair enough - yeah, I can see the appeal, though does feel somewhat awkward. Maybe a more generic tool would be suitable? (I guess you might want to reproduce compilations that don't error too, for instance - how'd did I get an object file with this feature, why isn't this symbol produced here, etc) So maybe some sort of --save_temps mode would be suitable?

I can also see the appeal, but I agree with David... doesn't the same situation arise for warnings as it does for errors? With errors, I can understand "crash on the first error" as being reasonable behavior, but for warnings, I think it's very plausible for users to want to crash on a *specific* warning, but I also think that applies to errors to a lesser extent. So I'm not certain how we extend this for those kinds of use cases. With warnings, we could perhaps find a way to support -fcrash-diagnostics-on-warning=strict-prototypes, but for errors, I don't know how we'd do that. (It could be that we punt on that until later, but if there's a more general design we can figure out up front, that would be better IMO.)

abrachet marked an inline comment as done.Feb 23 2022, 12:52 PM

In D120201#3341270, @aaron.ballman wrote:

I can also see the appeal, but I agree with David... doesn't the same situation arise for warnings as it does for errors? With errors, I can understand "crash on the first error" as being reasonable behavior, but for warnings, I think it's very plausible for users to want to crash on a *specific* warning, but I also think that applies to errors to a lesser extent. So I'm not certain how we extend this for those kinds of use cases. With warnings, we could perhaps find a way to support -fcrash-diagnostics-on-warning=strict-prototypes, but for errors, I don't know how we'd do that. (It could be that we punt on that until later, but if there's a more general design we can figure out up front, that would be better IMO.)

To be clear this doesn't crash on the first error. It just emits diagnostics if the command failed, as if it crashed. To have the same behavior with warnings -Werror should work well enough.

I agree with the two of you that this kind of feature could be designed better, at present it's just a quick way to get these files using an existing mechanism. Do either of you have any ideas on a better direction? lld has a really cool --reproduce flag which writes every file it opened to an output tar file. That would be very useful in clang for reproducing errors, but it's presumably a large undertaking.

In D120201#3341333, @abrachet wrote:

In D120201#3341270, @aaron.ballman wrote:

I can also see the appeal, but I agree with David... doesn't the same situation arise for warnings as it does for errors? With errors, I can understand "crash on the first error" as being reasonable behavior, but for warnings, I think it's very plausible for users to want to crash on a *specific* warning, but I also think that applies to errors to a lesser extent. So I'm not certain how we extend this for those kinds of use cases. With warnings, we could perhaps find a way to support -fcrash-diagnostics-on-warning=strict-prototypes, but for errors, I don't know how we'd do that. (It could be that we punt on that until later, but if there's a more general design we can figure out up front, that would be better IMO.)

To be clear this doesn't crash on the first error. It just emits diagnostics if the command failed, as if it crashed. To have the same behavior with warnings -Werror should work well enough.

I agree with the two of you that this kind of feature could be designed better, at present it's just a quick way to get these files using an existing mechanism. Do either of you have any ideas on a better direction? lld has a really cool --reproduce flag which writes every file it opened to an output tar file. That would be very useful in clang for reproducing errors, but it's presumably a large undertaking.

I'd have thought something like -save-temps could be used? Fleshed out to include the crash script-like command line reproduction, the preprocessed source, etc.

In D120201#3341333, @abrachet wrote:

In D120201#3341270, @aaron.ballman wrote:

I can also see the appeal, but I agree with David... doesn't the same situation arise for warnings as it does for errors? With errors, I can understand "crash on the first error" as being reasonable behavior, but for warnings, I think it's very plausible for users to want to crash on a *specific* warning, but I also think that applies to errors to a lesser extent. So I'm not certain how we extend this for those kinds of use cases. With warnings, we could perhaps find a way to support -fcrash-diagnostics-on-warning=strict-prototypes, but for errors, I don't know how we'd do that. (It could be that we punt on that until later, but if there's a more general design we can figure out up front, that would be better IMO.)

To be clear this doesn't crash on the first error. It just emits diagnostics if the command failed, as if it crashed. To have the same behavior with warnings -Werror should work well enough.

I agree with the two of you that this kind of feature could be designed better, at present it's just a quick way to get these files using an existing mechanism. Do either of you have any ideas on a better direction? lld has a really cool --reproduce flag which writes every file it opened to an output tar file. That would be very useful in clang for reproducing errors, but it's presumably a large undertaking.

Clang has -gen-reproducer option introduced in D27604.

In D120201#3342204, @phosek wrote:

In D120201#3341333, @abrachet wrote:

In D120201#3341270, @aaron.ballman wrote:

I can also see the appeal, but I agree with David... doesn't the same situation arise for warnings as it does for errors? With errors, I can understand "crash on the first error" as being reasonable behavior, but for warnings, I think it's very plausible for users to want to crash on a *specific* warning, but I also think that applies to errors to a lesser extent. So I'm not certain how we extend this for those kinds of use cases. With warnings, we could perhaps find a way to support -fcrash-diagnostics-on-warning=strict-prototypes, but for errors, I don't know how we'd do that. (It could be that we punt on that until later, but if there's a more general design we can figure out up front, that would be better IMO.)

To be clear this doesn't crash on the first error. It just emits diagnostics if the command failed, as if it crashed. To have the same behavior with warnings -Werror should work well enough.

I agree with the two of you that this kind of feature could be designed better, at present it's just a quick way to get these files using an existing mechanism. Do either of you have any ideas on a better direction? lld has a really cool --reproduce flag which writes every file it opened to an output tar file. That would be very useful in clang for reproducing errors, but it's presumably a large undertaking.

Clang has -gen-reproducer option introduced in D27604.

One possible direction would be to extend -gen-reproducer to take a value so you could specify -gen-reproducer=error (or perhaps -gen-reproducer=on-error).

I'm perhaps missing why this is desirable to be "on error" especially - is your use case to have this enabled by default in a distributed build scenario, so end users can locally reproduce the failure for further investigation? (Because the distributed build has too much overhead to iterate efficiently)

I was thinking this was more for compiler developers to investigate - so they could opt into the flag only when investigating (I guess on-error would mean that if they passed the flag to the whole build it would only dump output on the erroring compilations, not every compilation action in the build - is that the issue this is intended to address? Would some more general build feature (Bazel has this, not sure about other build systems) to pass flags only to particular actions be useful for this and other things?)

abrachet updated this revision to Diff 414811.Mar 11 2022, 10:35 PM

abrachet retitled this revision from [Clang] Add -fcrash-diagnostics-on-error flag to [Clang] Add -femit-reproducer flag.

abrachet edited the summary of this revision. (Show Details)

Herald added a project: Restricted Project. · View Herald TranscriptMar 11 2022, 10:35 PM

In D120201#3343509, @dblaikie wrote:

I'm perhaps missing why this is desirable to be "on error" especially - is your use case to have this enabled by default in a distributed build scenario, so end users can locally reproduce the failure for further investigation? (Because the distributed build has too much overhead to iterate efficiently)

I was thinking this was more for compiler developers to investigate - so they could opt into the flag only when investigating (I guess on-error would mean that if they passed the flag to the whole build it would only dump output on the erroring compilations, not every compilation action in the build - is that the issue this is intended to address? Would some more general build feature (Bazel has this, not sure about other build systems) to pass flags only to particular actions be useful for this and other things?)

What do you think about this kind of direction?

Harbormaster completed remote builds in B153893: Diff 414811.Mar 11 2022, 11:06 PM

This direction sounds better to me - could the tar file support be added independntly, perhaps as an improvement to the existing crash reproduction infrastructure? (ideally making this new functionality as small/simple as possible - "just give me the same thing the crash reproducer does, because I want it/even if it's not crashing")

+1, useful.

In D120201#3377387, @dblaikie wrote:

This direction sounds better to me - could the tar file support be added independntly, perhaps as an improvement to the existing crash reproduction infrastructure?

I'm weary about changing the default behavior of the crash reproduction infrastructure. There's a lot of subtly and platform specific (mostly Darwin) handling. Although I think something like this is better than the current crash reproducer. It's more generic and could be used for any crashing job not just the compiler, and one tar file is a lot easier to upload when submitting a bug than multiple files.

Adding folks who added the Apple specific handling if they want to chime in. @bogner @bruno @t.p.northover

(ideally making this new functionality as small/simple as possible - "just give me the same thing the crash reproducer does, because I want it/even if it's not crashing")

Agreed, this was the initial plan, albeit on error and not always.

In D120201#3343509, @dblaikie wrote:

I'm perhaps missing why this is desirable to be "on error" especially - is your use case to have this enabled by default in a distributed build scenario, so end users can locally reproduce the failure for further investigation? (Because the distributed build has too much overhead to iterate efficiently)

I was thinking this was more for compiler developers to investigate - so they could opt into the flag only when investigating

That was not the initial intention, although where this patch is now, a developer could use -femit-reproducer=always for that purpose.

(I guess on-error would mean that if they passed the flag to the whole build it would only dump output on the erroring compilations, not every compilation action in the build - is that the issue this is intended to address?

Yes, we think this has value specifically when the compilation is happening asynchronously, like on bots, where you don't have the luxury of just rerunning a failed build step with a new flag. Being able to easily get the preprocessed source file is super useful in this case where having to go find all the includes on your own could be a huge pain. We specifically compile our code base with a ToT llvm, for us a lot of errors could be something like libcxx changed some configuration, and being able to see the preprocessed source file in these cases is helpful.

FWIW, most errors are going to be straightforward enough that the error message alone should be enough. But having an opt-in flag you can set globally that only affects failing compilations to help debug why they failed I think has value.

Would some more general build feature (Bazel has this, not sure about other build systems) to pass flags only to particular actions be useful for this and other things?)

Could you expand on this? Does Bazel have a way to rerun a failed action?

In D120201#3377514, @abrachet wrote:

(I guess on-error would mean that if they passed the flag to the whole build it would only dump output on the erroring compilations, not every compilation action in the build - is that the issue this is intended to address?

Yes, we think this has value specifically when the compilation is happening asynchronously, like on bots, where you don't have the luxury of just rerunning a failed build step with a new flag. Being able to easily get the preprocessed source file is super useful in this case where having to go find all the includes on your own could be a huge pain. We specifically compile our code base with a ToT llvm, for us a lot of errors could be something like libcxx changed some configuration, and being able to see the preprocessed source file in these cases is helpful.

FWIW, most errors are going to be straightforward enough that the error message alone should be enough. But having an opt-in flag you can set globally that only affects failing compilations to help debug why they failed I think has value.

To provide another use case, we have bots that cover large number of build configurations whereas most developers build only the most common ones locally. Sometimes, we see build errors that only impact some builders that use the more exotic configurations and we have to replicate the build locally to reproduce the issue which can take non-trivial effort.

With this feature, we would like to simplify the process by automatically collecting reproducers on errors so developers don't need to replicate the full build in order to reproduce a specific build error.

I don't think this use case is specific only to Fuchsia, the same is true for other projects. In LLVM, we often see build errors only on less common bot configurations and having reproducers available for those cases might help LLVM developers as well.

I like this direction, thank you! I think you should also add a release note for the new functionality.

clang/include/clang/Driver/Options.td
1393

xbolva00 added inline comments.Mar 14 2022, 5:47 AM

clang/tools/driver/driver.cpp
492	Make default value configurable with cmake variable?

In D120201#3377514, @abrachet wrote:

In D120201#3377387, @dblaikie wrote:

This direction sounds better to me - could the tar file support be added independntly, perhaps as an improvement to the existing crash reproduction infrastructure?

I'm weary about changing the default behavior of the crash reproduction infrastructure. There's a lot of subtly and platform specific (mostly Darwin) handling. Although I think something like this is better than the current crash reproducer. It's more generic and could be used for any crashing job not just the compiler, and one tar file is a lot easier to upload when submitting a bug than multiple files.

I appreciate that, but would also like to avoid building subtly different-yet-similar functionality (especially for things that are off-by-default and so might be under-exercised if they have their own standalone implementation). I think it's worth trying to reconcile the functionality as much as possible.

Another thing that might be good is if you could work to enable this by default on LLVM buildbots (if there's a practical way to make these reproducers available to developers via the buildbot infrastructure somehow) - that'd add value to LLVM developers and ensure the feature is exercised regularly.

maybe the crash reproducer functionality could be the default here - could have = {none, crash, error, always} and "crash" being the default (& I guess then "error" has to be "error or crash") but maybe that's not helpful - perhaps no one would actually want to turn it off entirely?

Adding folks who added the Apple specific handling if they want to chime in. @bogner @bruno @t.p.northover

Yep, happy to get some more perspective for sure.

(ideally making this new functionality as small/simple as possible - "just give me the same thing the crash reproducer does, because I want it/even if it's not crashing")

Agreed, this was the initial plan, albeit on error and not always.

Ah, right.

In D120201#3343509, @dblaikie wrote:

I'm perhaps missing why this is desirable to be "on error" especially - is your use case to have this enabled by default in a distributed build scenario, so end users can locally reproduce the failure for further investigation? (Because the distributed build has too much overhead to iterate efficiently)

I was thinking this was more for compiler developers to investigate - so they could opt into the flag only when investigating

That was not the initial intention, although where this patch is now, a developer could use -femit-reproducer=always for that purpose.

(I guess on-error would mean that if they passed the flag to the whole build it would only dump output on the erroring compilations, not every compilation action in the build - is that the issue this is intended to address?

Yes, we think this has value specifically when the compilation is happening asynchronously, like on bots, where you don't have the luxury of just rerunning a failed build step with a new flag. Being able to easily get the preprocessed source file is super useful in this case where having to go find all the includes on your own could be a huge pain. We specifically compile our code base with a ToT llvm, for us a lot of errors could be something like libcxx changed some configuration, and being able to see the preprocessed source file in these cases is helpful.

FWIW, most errors are going to be straightforward enough that the error message alone should be enough. But having an opt-in flag you can set globally that only affects failing compilations to help debug why they failed I think has value.

Would some more general build feature (Bazel has this, not sure about other build systems) to pass flags only to particular actions be useful for this and other things?)

Could you expand on this? Does Bazel have a way to rerun a failed action?

No, not that I know of - but it does have this: https://docs.bazel.build/versions/main/user-manual.html#flag--per_file_copt - which could be used on a manual rerun. (I assume that most of bazel's custom configuration is done on the bazel command line rather than in a pre-configure step? (at least that's mostly how it works internally) so mostly you could copy the bazel command from the buildbot, add a per_file_copt to dump the reproducer, and reproduce the build locally? but I guess if you could already copy/paste the bazel command then you wouldn't need the reproducer feature... so nevermind that I guess)

abrachet mentioned this in D121725: [clang][test] Add using-lld feature variable.Mar 15 2022, 7:53 PM

It seems like this change is doing two things:

producing reproducer files on any error, not just crashes
producing tar file reproducers

Both of those seem like reasonable features, but I think it would be nice to separate them, and make sure that they integrate into all of the existing crash reproducer functionality, like freproducer-dir (I see it handles this already).

I can imagine that, depending on the build failure mode, producing a pre-processed reproducer may work better than producing a tar file. It would be nice to let the user choose.

The tar file logic doesn't seem complete. The set of files needed to reproduce a command is far more than just the input files listed on the command line.

In D120201#3378354, @phosek wrote:

To provide another use case, we have bots that cover large number of build configurations whereas most developers build only the most common ones locally. Sometimes, we see build errors that only impact some builders that use the more exotic configurations and we have to replicate the build locally to reproduce the issue which can take non-trivial effort.

With this feature, we would like to simplify the process by automatically collecting reproducers on errors so developers don't need to replicate the full build in order to reproduce a specific build error.

I don't think this use case is specific only to Fuchsia, the same is true for other projects. In LLVM, we often see build errors only on less common bot configurations and having reproducers available for those cases might help LLVM developers as well.

I'm not sure I understand who the user of the reproducer file is here. Are we talking about compiler developers or compiler users? I don't see how a user would have a use case for these reproducer tar files, except perhaps a as a way to report non-crash bugs (error-on-valid). This feature mainly seems useful as a way for compiler developers to automatically collect compiler inputs from complex build systems. This could be especially useful when modules are involved.

clang/test/Driver/reproduce.c
3 ↗	(On Diff #414811)	This test seems pretty shell-y. Make sure it passes the Windows presubmit tests.

Emit reproducer tar file unconditionally on crashes.

Herald added a reviewer: awarzynski. · View Herald TranscriptMar 21 2022, 10:23 AM

In D120201#3381444, @dblaikie wrote:

I appreciate that, but would also like to avoid building subtly different-yet-similar functionality (especially for things that are off-by-default and so might be under-exercised if they have their own standalone implementation). I think it's worth trying to reconcile the functionality as much as possible.

Sure that makes sense. I'm thinking we could do a soft transition. So for now we have the reproducer file included alongside the other files that are currently emitted. Then maybe send out a thread on the mailing list that the intention is to remove the individual files in lieu of just the tar file in the near future.

Another thing that might be good is if you could work to enable this by default on LLVM buildbots (if there's a practical way to make these reproducers available to developers via the buildbot infrastructure somehow) - that'd add value to LLVM developers and ensure the feature is exercised regularly.

Indeed, that sounds like a good plan, I will look into that.

In D120201#3387159, @rnk wrote:

It seems like this change is doing two things:

producing reproducer files on any error, not just crashes

producing tar file reproducers

Both of those seem like reasonable features, but I think it would be nice to separate them, and make sure that they integrate into all of the existing crash reproducer functionality, like freproducer-dir (I see it handles this already).

I can imagine that, depending on the build failure mode, producing a pre-processed reproducer may work better than producing a tar file. It would be nice to let the user choose.

The tar file logic doesn't seem complete. The set of files needed to reproduce a command is far more than just the input files listed on the command line.

It isn't meant to be a completely comprehensive way to reproduce a compilation action. Libraries implicitly added to the link will of course not be here, but header files will because the preprocessed sources will be emited.

Is there anything in particular you think is missing?

In D120201#3378354, @phosek wrote:

To provide another use case, we have bots that cover large number of build configurations whereas most developers build only the most common ones locally. Sometimes, we see build errors that only impact some builders that use the more exotic configurations and we have to replicate the build locally to reproduce the issue which can take non-trivial effort.

With this feature, we would like to simplify the process by automatically collecting reproducers on errors so developers don't need to replicate the full build in order to reproduce a specific build error.

I don't think this use case is specific only to Fuchsia, the same is true for other projects. In LLVM, we often see build errors only on less common bot configurations and having reproducers available for those cases might help LLVM developers as well.

I'm not sure I understand who the user of the reproducer file is here. Are we talking about compiler developers or compiler users? I don't see how a user would have a use case for these reproducer tar files, except perhaps a as a way to report non-crash bugs (error-on-valid). This feature mainly seems useful as a way for compiler developers to automatically collect compiler inputs from complex build systems. This could be especially useful when modules are involved.

I think it can be useful for both. Our primary motivation here was to get most of the files necessary to reproduce a failed compilation very easily.

In D120201#3379185, @aaron.ballman wrote:

I like this direction, thank you! I think you should also add a release note for the new functionality.

Ack. I will do this when we get closer to consensus.

clang/test/Driver/reproduce.c
3 ↗	(On Diff #414811)	Thanks. Looks like `stat` was not available.

Harbormaster completed remote builds in B155434: Diff 417004.Mar 21 2022, 11:19 AM

This is pretty cool, I enjoy the idea of getting a tar out of a crash. I'm also a +1 for having this group of behaviors as a more official -femit-reproducer=<option> flag. In future work, do you plan to change the default crash mode to output a tar instead of multiple files?

For this specific patch: it's fine that both -gen-reproducer and FORCE_CLANG_DIAGNOSTICS_CRASH have their own specific meanings and the driver should parse both in terms of -femit-reproducer=always + TURN_OFF_TAR, so I'd prefer if the patch steers a bit more into that direction. More comments inline.

clang/tools/driver/driver.cpp
486	Can you relate this to a enum here already? Perhaps use some bits for the level style and one to track using TAR?
510	This is the same path as `-femit-reproducer=always` minus a few specific things, this path could be unified.

bruno added a reviewer: bruno.Mar 21 2022, 3:02 PM

I tihnk I've lost track of the variants here - if you're waiting on review feedback, at least for me, it'd be helpful to have a high level description of the state of the patch, what it does/doesn't do in terms of how it interacts with existing crash reporting and what new functionality it exposes - and the planned direction (things you intend to do in a relatively timely fashion, versus "things someone could do at some point in the future/if they were so inclined").

Soft transition has some value, though some risk (that it's never finished and stays in a hybrid state) - I'd rather avoid it if it's not too hard - post something to the forums and see if anyone would be particularly broken by moving to a tar file maybe? But if other folks feel more strongly about a slower roll out, that's OK.

abrachet updated this revision to Diff 417696.Mar 23 2022, 11:18 AM

abrachet marked an inline comment as done.

abrachet edited the summary of this revision. (Show Details)

abrachet added a parent revision: D122335: [clang] Emit crash reproduction as a single tar file.

In D120201#3400286, @dblaikie wrote:

I tihnk I've lost track of the variants here - if you're waiting on review feedback, at least for me, it'd be helpful to have a high level description of the state of the patch, what it does/doesn't do in terms of how it interacts with existing crash reporting and what new functionality it exposes - and the planned direction (things you intend to do in a relatively timely fashion, versus "things someone could do at some point in the future/if they were so inclined").

I've split this patch up, as suggested by @rnk. This patch now just adds the -femit-reproducer= option and D122335 changes crash reproduction into a tar file.

Soft transition has some value, though some risk (that it's never finished and stays in a hybrid state) - I'd rather avoid it if it's not too hard - post something to the forums and see if anyone would be particularly broken by moving to a tar file maybe? But if other folks feel more strongly about a slower roll out, that's OK.

Sure, I've gone with a hard transition and have posted about it on the mailing list https://discourse.llvm.org/t/changing-clangs-crash-reproduction-feature/61171

Harbormaster completed remote builds in B155916: Diff 417696.Mar 23 2022, 11:56 AM

Thanks, I like this approach. I haven't had a chance to do more detailed code review in the last week, if someone else can.

Herald added a subscriber: MaskRay. · View Herald TranscriptMar 29 2022, 1:33 PM

MaskRay added inline comments.Mar 29 2022, 3:04 PM

clang/test/Driver/emit-reproducer.c
25	This is the first time `tar` is used in a test. I am unsure whether all Windows bots will provide it. Probably prepared that you may need `UNSUPPORTED: system-windows`
41	No newline at end of file
clang/tools/driver/driver.cpp
486	Consider `enum class` to not add common names like `Off` `Always` to the class.

hans added inline comments.Mar 30 2022, 1:28 AM

clang/test/Driver/emit-reproducer.c
25	Many lld tests use tar already, also on Windows. (e.g. lld/test/ELF/reproduce-error.s which doesn't have any special requirements), so hopefully it will work for Clang tests too.

D122335 was a requested feature for emitting reproducers as a single tar file, but it seems to have stalled. What do folks think about moving forward with this patch without emitting the reproducer as a tar file?

This patch no longer depends on D122335 which created the reproducer as a tar file. Now this will just emit the crash reproduction information as usual, but under other circumstances given the value of -femit-reproducer.

abrachet removed a parent revision: D122335: [clang] Emit crash reproduction as a single tar file.May 23 2022, 8:03 AM

Harbormaster completed remote builds in B165844: Diff 431378.May 23 2022, 8:57 AM

In D120201#3531499, @abrachet wrote:

This patch no longer depends on D122335 which created the reproducer as a tar file. Now this will just emit the crash reproduction information as usual, but under other circumstances given the value of -femit-reproducer.

Thanks, I think the current functionality makes sense to me, and the patch seems very focused now.

My only high-level comment is that it seems this overlaps with the existing -gen-reproducer flag and FORCE_CLANG_DIAGNOSTICS_CRASH environment variable. Could the new functionality be implemented as new arguments to the existing flag/variable?

In D120201#3534534, @hans wrote:

In D120201#3531499, @abrachet wrote:

This patch no longer depends on D122335 which created the reproducer as a tar file. Now this will just emit the crash reproduction information as usual, but under other circumstances given the value of -femit-reproducer.

Thanks, I think the current functionality makes sense to me, and the patch seems very focused now.

My only high-level comment is that it seems this overlaps with the existing -gen-reproducer flag and FORCE_CLANG_DIAGNOSTICS_CRASH environment variable. Could the new functionality be implemented as new arguments to the existing flag/variable?

Good call. I've moved away from the new flag in favor of reusing -gen-reproducer for this.

Harbormaster completed remote builds in B166165: Diff 431833.May 24 2022, 5:46 PM

hans added inline comments.May 25 2022, 9:57 AM

clang/include/clang/Driver/Driver.h
483	I think the more typical clang name for this would be maybeGenerateCompilationDiagnostics
clang/include/clang/Driver/Options.td
558	Should we just drop the "on-" prefixes? It makes them a little less clear, but shorter and cleaner (and hyphens in arguments seems unusual).
clang/lib/Driver/Driver.cpp
1174	Would it make sense to move this to driver.cpp with the rest of the logic? That way we could drop the variable too.
clang/tools/driver/driver.cpp
493	Should we reject or at least warn about invalid arguments here?
506	The old code seems to pretend every command failed?

abrachet updated this revision to Diff 432047.May 25 2022, 10:59 AM

abrachet marked 5 inline comments as done.

abrachet added inline comments.

clang/tools/driver/driver.cpp
506	It later looped through all failing commands and then if it was a crash, which it made them seem, and then would emit the reproducer and then break out of the loop. So it essentially only acted as if the first command failed.

Harbormaster completed remote builds in B166311: Diff 432047.May 25 2022, 12:09 PM

lgtm

clang/test/Driver/emit-reproducer.c
2	I'm not sure if lit handles that semicolon, or if it hands this over to the shell, in which case it won't work on windows. Instead, `rm -rf %t && mkdir %t` seems common among clang tests. Otherwise, this is a nice test file :-)

This revision is now accepted and ready to land.May 27 2022, 8:42 AM

This revision was landed with ongoing or failed builds.May 27 2022, 8:50 AM

Closed by commit rG684c08010876: [Clang] Extend -gen-reproducer flag (authored by abrachet). · Explain Why

This revision was automatically updated to reflect the committed changes.

abrachet added a commit: rG684c08010876: [Clang] Extend -gen-reproducer flag.

Herald added a project: Restricted Project. · View Herald TranscriptMay 27 2022, 8:50 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

abrachet marked an inline comment as done.May 27 2022, 8:51 AM

abrachet added inline comments.

clang/test/Driver/emit-reproducer.c
2	Thanks :) updated it in the commit.

Looks like this breaks tests on (at least) Mac and window, see eg http://45.33.8.238/macm1/36198/step_7.txt (passes on my Linux bit though).

Please take a look and revert for now if it takes a while to fix.

abrachet added a reverting change: rG4dc3893eeb47: Revert "[Clang] Extend -gen-reproducer flag".May 27 2022, 10:04 AM

uabelho added a subscriber: uabelho.May 30 2022, 12:12 AM

Fix tests on macOS and compile test with -fsyntax-only

abrachet updated this revision to Diff 433122.May 31 2022, 10:09 AM

This revision was landed with ongoing or failed builds.May 31 2022, 10:11 AM

abrachet added a commit: rG7d76d6095880: [Clang] Extend -gen-reproducer flag.

The test you added seems to be failing on the PS4 Windows bot. A quick glance seems to suggest that you aren't properly escaping the path separators somewhere. Can you take a look and revert if you need time to investigate?

https://lab.llvm.org/buildbot/#/builders/216/builds/5164

In D120201#3547834, @dyung wrote:

The test you added seems to be failing on the PS4 Windows bot. A quick glance seems to suggest that you aren't properly escaping the path separators somewhere. Can you take a look and revert if you need time to investigate?

https://lab.llvm.org/buildbot/#/builders/216/builds/5164

Should be fixed by https://github.com/llvm/llvm-project/commit/a0ef52cc102504c4282dec7001664ee020396681

In D120201#3547835, @abrachet wrote:

In D120201#3547834, @dyung wrote:

The test you added seems to be failing on the PS4 Windows bot. A quick glance seems to suggest that you aren't properly escaping the path separators somewhere. Can you take a look and revert if you need time to investigate?

https://lab.llvm.org/buildbot/#/builders/216/builds/5164

Should be fixed by https://github.com/llvm/llvm-project/commit/a0ef52cc102504c4282dec7001664ee020396681

Indeed it has, thanks for the quick action!

Harbormaster completed remote builds in B167074: Diff 433122.May 31 2022, 10:53 AM

This patch breaks msan bots: https://lab.llvm.org/buildbot/#/builders/5/builds/24307 and https://lab.llvm.org/buildbot/#/builders/74

https://lab.llvm.org/buildbot/#/builders/5/builds/24335 is (last green build https://lab.llvm.org/buildbot/#/builders/5/builds/24306 + this patch)

FYI @browneee

Revision Contents

Path

Size

clang/

include/

clang/

Driver/

Driver.h

7 lines

Options.td

7 lines

lib/

Driver/

Driver.cpp

87 lines

test/

Driver/

emit-reproducer.c

40 lines

tools/

driver/

driver.cpp

35 lines

Diff 417696

clang/include/clang/Driver/Driver.h

Show All 12 Lines
#include "clang/Basic/LLVM.h"		#include "clang/Basic/LLVM.h"
#include "clang/Driver/Action.h"		#include "clang/Driver/Action.h"
#include "clang/Driver/InputInfo.h"		#include "clang/Driver/InputInfo.h"
#include "clang/Driver/Options.h"		#include "clang/Driver/Options.h"
#include "clang/Driver/Phases.h"		#include "clang/Driver/Phases.h"
#include "clang/Driver/ToolChain.h"		#include "clang/Driver/ToolChain.h"
#include "clang/Driver/Types.h"		#include "clang/Driver/Types.h"
#include "clang/Driver/Util.h"		#include "clang/Driver/Util.h"
		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/Option/Arg.h"		#include "llvm/Option/Arg.h"
#include "llvm/Option/ArgList.h"		#include "llvm/Option/ArgList.h"
#include "llvm/Support/StringSaver.h"		#include "llvm/Support/StringSaver.h"

#include <list>		#include <list>
#include <map>		#include <map>
▲ Show 20 Lines • Show All 432 Lines • ▼ Show 20 Lines	public:
/// generateCompilationDiagnostics - Generate diagnostics information		/// generateCompilationDiagnostics - Generate diagnostics information
/// including preprocessed source file(s).		/// including preprocessed source file(s).
///		///
void generateCompilationDiagnostics(		void generateCompilationDiagnostics(
Compilation &C, const Command &FailingCommand,		Compilation &C, const Command &FailingCommand,
StringRef AdditionalInformation = "",		StringRef AdditionalInformation = "",
CompilationDiagnosticReport *GeneratedReport = nullptr);		CompilationDiagnosticReport *GeneratedReport = nullptr);

		llvm::Optional<std::string>
		generateReproducerFile(Compilation &C,
		const Command *FailingCommand = nullptr,
		StringRef AdditionalInformation = "",
		CompilationDiagnosticReport *Report = nullptr);

/// @}		/// @}
/// @name Helper Methods		/// @name Helper Methods
/// @{		/// @{

/// PrintActions - Print the list of actions.		/// PrintActions - Print the list of actions.
void PrintActions(const Compilation &C) const;		void PrintActions(const Compilation &C) const;

/// PrintHelp - Print the help text.		/// PrintHelp - Print the help text.
		hansUnsubmitted Done Reply Inline Actions I think the more typical clang name for this would be maybeGenerateCompilationDiagnostics hans: I think the more typical clang name for this would be maybeGenerateCompilationDiagnostics
///		///
/// \param ShowHidden - Show hidden options.		/// \param ShowHidden - Show hidden options.
void PrintHelp(bool ShowHidden) const;		void PrintHelp(bool ShowHidden) const;

/// PrintVersion - Print the driver version.		/// PrintVersion - Print the driver version.
void PrintVersion(const Compilation &C, raw_ostream &OS) const;		void PrintVersion(const Compilation &C, raw_ostream &OS) const;

/// GetFilePath - Lookup \p Name in the list of file search paths.		/// GetFilePath - Lookup \p Name in the list of file search paths.
▲ Show 20 Lines • Show All 202 Lines • Show Last 20 Lines

clang/include/clang/Driver/Options.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 549 Lines • ▼ Show 20 Lines

def ccc_arcmt_migrate : Separate<["-"], "ccc-arcmt-migrate">, InternalDriverOpt,

HelpText<"Apply modifications and produces temporary files that conform to ARC">;

def arcmt_migrate_report_output : Separate<["-"], "arcmt-migrate-report-output">,

HelpText<"Output path for the plist report">, Flags<[CC1Option]>,

MarshallingInfoString<FrontendOpts<"ARCMTMigrateReportOut">>;

def arcmt_migrate_emit_arc_errors : Flag<["-"], "arcmt-migrate-emit-errors">,

HelpText<"Emit ARC errors even if the migrator can fix them">, Flags<[CC1Option]>,

MarshallingInfoFlag<FrontendOpts<"ARCMTMigrateEmitARCErrors">>;

def gen_reproducer: Flag<["-"], "gen-reproducer">, InternalDebugOpt,

HelpText<"Auto-generates preprocessed source files and a reproduction script">;

hansUnsubmitted

Done

Should we just drop the "on-" prefixes? It makes them a little less clear, but shorter and cleaner (and hyphens in arguments seems unusual).

hans: Should we just drop the "on-" prefixes? It makes them a little less clear, but shorter and…

def gen_cdb_fragment_path: Separate<["-"], "gen-cdb-fragment-path">, InternalDebugOpt,

HelpText<"Emit a compilation database fragment to the specified directory">;

def round_trip_args : Flag<["-"], "round-trip-args">, Flags<[CC1Option, NoDriverOption]>,

HelpText<"Enable command line arguments round-trip.">;

def no_round_trip_args : Flag<["-"], "no-round-trip-args">, Flags<[CC1Option, NoDriverOption]>,

HelpText<"Disable command line arguments round-trip.">;

▲ Show 20 Lines • Show All 812 Lines • ▼ Show 20 Lines

def fconstant_string_class_EQ : Joined<["-"], "fconstant-string-class=">, Group<f_Group>;

def fconstexpr_depth_EQ : Joined<["-"], "fconstexpr-depth=">, Group<f_Group>;

def fconstexpr_steps_EQ : Joined<["-"], "fconstexpr-steps=">, Group<f_Group>;

def fexperimental_new_constant_interpreter : Flag<["-"], "fexperimental-new-constant-interpreter">, Group<f_Group>,

HelpText<"Enable the experimental new constant interpreter">, Flags<[CC1Option]>,

MarshallingInfoFlag<LangOpts<"EnableNewConstInterp">>;

def fconstexpr_backtrace_limit_EQ : Joined<["-"], "fconstexpr-backtrace-limit=">,

Group<f_Group>;

def fno_crash_diagnostics : Flag<["-"], "fno-crash-diagnostics">, Group<f_clang_Group>, Flags<[NoArgumentUnused, CoreOption]>,

HelpText<"Disable auto-generation of preprocessed source files and a script for reproduction during a clang crash">;

def fcrash_diagnostics_dir : Joined<["-"], "fcrash-diagnostics-dir=">,

Group<f_clang_Group>, Flags<[NoArgumentUnused, CoreOption]>,

HelpText<"Put crash-report files in <dir>">, MetaVarName<"<dir>">;

def femit_reproducer : Joined<["-"], "femit-reproducer=">, Group<f_clang_Group>, Flags<[NoArgumentUnused, CoreOption]>,

xbolva00Unsubmitted

Done

diagnostics

xbolva00: diagnostics

abrachetAuthorUnsubmitted

Done

Thanks!

abrachet: Thanks!

HelpText<"Emit reproducer on (option: off, on-crash (default), on-error, always)">;

def fno_crash_diagnostics : Flag<["-"], "fno-crash-diagnostics">, Group<f_clang_Group>, Flags<[NoArgumentUnused, CoreOption]>,

xbolva00Unsubmitted

Done

source

xbolva00: source

Alias<femit_reproducer>, AliasArgs<["off"]>,

aaron.ballmanUnsubmitted

Done

def femit_reproducer : Joined<["-"], "femit-reproducer=">, Group<f_clang_Group>, Flags<[NoArgumentUnused, CoreOption]>,

- HelpText<"Emit reproducer on (option: off (default), error, always)">;

+ HelpText<"Emit reproducer on (option: off (default), on-error, always)">;

def fcreate_profile : Flag<["-"], "fcreate-profile">, Group<f_Group>;

aaron.ballman:

HelpText<"Disable auto-generation of preprocessed source files and a script for reproduction during a clang crash">;

def fcreate_profile : Flag<["-"], "fcreate-profile">, Group<f_Group>;

defm cxx_exceptions: BoolFOption<"cxx-exceptions",

LangOpts<"CXXExceptions">, DefaultFalse,

PosFlag<SetTrue, [CC1Option], "Enable C++ exceptions">, NegFlag<SetFalse>>;

defm async_exceptions: BoolFOption<"async-exceptions",

LangOpts<"EHAsynch">, DefaultFalse,

PosFlag<SetTrue, [CC1Option], "Enable EH Asynchronous exceptions">, NegFlag<SetFalse>>;

defm cxx_modules : BoolFOption<"cxx-modules",

▲ Show 20 Lines • Show All 5,216 Lines • Show Last 20 Lines

clang/lib/Driver/Driver.cpp

Show First 20 Lines • Show All 1,165 Lines • ▼ Show 20 Lines	Compilation Driver::BuildCompilation(ArrayRef<const char > ArgList) {
// options, either by introducing new ones or by overloading gcc ones like -V		// options, either by introducing new ones or by overloading gcc ones like -V
// or -b.		// or -b.
CCCPrintPhases = Args.hasArg(options::OPT_ccc_print_phases);		CCCPrintPhases = Args.hasArg(options::OPT_ccc_print_phases);
CCCPrintBindings = Args.hasArg(options::OPT_ccc_print_bindings);		CCCPrintBindings = Args.hasArg(options::OPT_ccc_print_bindings);
if (const Arg *A = Args.getLastArg(options::OPT_ccc_gcc_name))		if (const Arg *A = Args.getLastArg(options::OPT_ccc_gcc_name))
CCCGenericGCCName = A->getValue();		CCCGenericGCCName = A->getValue();
GenReproducer = Args.hasFlag(options::OPT_gen_reproducer,		GenReproducer = Args.hasFlag(options::OPT_gen_reproducer,
options::OPT_fno_crash_diagnostics,		options::OPT_fno_crash_diagnostics,
!!::getenv("FORCE_CLANG_DIAGNOSTICS_CRASH"));		!!::getenv("FORCE_CLANG_DIAGNOSTICS_CRASH"));
		hansUnsubmitted Done Reply Inline Actions Would it make sense to move this to driver.cpp with the rest of the logic? That way we could drop the variable too. hans: Would it make sense to move this to driver.cpp with the rest of the logic? That way we could…

// Process -fproc-stat-report options.		// Process -fproc-stat-report options.
if (const Arg *A = Args.getLastArg(options::OPT_fproc_stat_report_EQ)) {		if (const Arg *A = Args.getLastArg(options::OPT_fproc_stat_report_EQ)) {
CCPrintProcessStats = true;		CCPrintProcessStats = true;
CCPrintStatReportFilename = A->getValue();		CCPrintStatReportFilename = A->getValue();
}		}
if (Args.hasArg(options::OPT_fproc_stat_report))		if (Args.hasArg(options::OPT_fproc_stat_report))
CCPrintProcessStats = true;		CCPrintProcessStats = true;
▲ Show 20 Lines • Show All 232 Lines • ▼ Show 20 Lines	openReproTarFile(Driver &D, Compilation &C) {
if (!TarWriterOrErr) {		if (!TarWriterOrErr) {
D.Diag(clang::diag::err_unable_to_make_temp) << TmpName;		D.Diag(clang::diag::err_unable_to_make_temp) << TmpName;
return {};		return {};
}		}

return {std::move(*TarWriterOrErr), TmpName.c_str()};		return {std::move(*TarWriterOrErr), TmpName.c_str()};
}		}

// When clang crashes, produce diagnostic information including the fully		llvm::Optional<std::string>
// preprocessed source file(s). Request that the developer attach the		Driver::generateReproducerFile(Compilation &C, const Command *FailingCommand,
// diagnostic information to a bug report.		StringRef AdditionalInformation,
void Driver::generateCompilationDiagnostics(		CompilationDiagnosticReport *Report) {
Compilation &C, const Command &FailingCommand,
StringRef AdditionalInformation, CompilationDiagnosticReport *Report) {
if (C.getArgs().hasArg(options::OPT_fno_crash_diagnostics))
return;

// Don't try to generate diagnostics for link or dsymutil jobs.
if (FailingCommand.getCreator().isLinkJob() \|\|
FailingCommand.getCreator().isDsymutilJob())
return;

// Print the version of the compiler.
PrintVersion(C, llvm::errs());

// Suppress driver output and emit preprocessor output to temp file.		// Suppress driver output and emit preprocessor output to temp file.
CCGenDiagnostics = true;		CCGenDiagnostics = true;

// Save the original job command(s).		// Save the original job command(s).
Command Cmd = FailingCommand;		std::unique_ptr<Command> Cmd;
		if (FailingCommand)
		Cmd = std::make_unique<Command>(*FailingCommand);

// Keep track of whether we produce any errors while trying to produce		// Keep track of whether we produce any errors while trying to produce
// preprocessed sources.		// preprocessed sources.
DiagnosticErrorTrap Trap(Diags);		DiagnosticErrorTrap Trap(Diags);

// Suppress tool output.		// Suppress tool output.
C.initCompilationForDiagnostics();		C.initCompilationForDiagnostics();

Show All 22 Lines	if (IgnoreInput) {
++it;		++it;
}		}
}		}

if (Inputs.empty()) {		if (Inputs.empty()) {
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "Error generating preprocessed source(s) - "		<< "Error generating preprocessed source(s) - "
"no preprocessable inputs.";		"no preprocessable inputs.";
return;		return {};
}		}

// Don't attempt to generate preprocessed files if multiple -arch options are		// Don't attempt to generate preprocessed files if multiple -arch options are
// used, unless they're all duplicates.		// used, unless they're all duplicates.
llvm::StringSet<> ArchNames;		llvm::StringSet<> ArchNames;
for (const Arg *A : C.getArgs()) {		for (const Arg *A : C.getArgs()) {
if (A->getOption().matches(options::OPT_arch)) {		if (A->getOption().matches(options::OPT_arch)) {
StringRef ArchName = A->getValue();		StringRef ArchName = A->getValue();
ArchNames.insert(ArchName);		ArchNames.insert(ArchName);
}		}
}		}
if (ArchNames.size() > 1) {		if (ArchNames.size() > 1) {
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "Error generating preprocessed source(s) - cannot generate "		<< "Error generating preprocessed source(s) - cannot generate "
"preprocessed source with multiple -arch options.";		"preprocessed source with multiple -arch options.";
return;		return {};
}		}

// Construct the list of abstract actions to perform for this compilation. On		// Construct the list of abstract actions to perform for this compilation. On
// Darwin OSes this uses the driver-driver and builds universal actions.		// Darwin OSes this uses the driver-driver and builds universal actions.
const ToolChain &TC = C.getDefaultToolChain();		const ToolChain &TC = C.getDefaultToolChain();
if (TC.getTriple().isOSBinFormatMachO())		if (TC.getTriple().isOSBinFormatMachO())
BuildUniversalActions(C, TC, Inputs);		BuildUniversalActions(C, TC, Inputs);
else		else
BuildActions(C, C.getArgs(), Inputs, C.getActions());		BuildActions(C, C.getArgs(), Inputs, C.getActions());

BuildJobs(C);		BuildJobs(C);

// If there were errors building the compilation, quit now.		// If there were errors building the compilation, quit now.
if (Trap.hasErrorOccurred()) {		if (Trap.hasErrorOccurred()) {
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "Error generating preprocessed source(s).";		<< "Error generating preprocessed source(s).";
return;		return {};
}		}

// Generate preprocessed output.		// Generate preprocessed output.
SmallVector<std::pair<int, const Command *>, 4> FailingCommands;		SmallVector<std::pair<int, const Command *>, 4> FailingCommands;
C.ExecuteJobs(C.getJobs(), FailingCommands);		C.ExecuteJobs(C.getJobs(), FailingCommands);

// If any of the preprocessing commands failed, clean up and exit.		// If any of the preprocessing commands failed, clean up and exit.
if (!FailingCommands.empty()) {		if (!FailingCommands.empty()) {
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "Error generating preprocessed source(s).";		<< "Error generating preprocessed source(s).";
return;		return {};
}		}

const ArgStringList &TempFiles = C.getTempFiles();		const ArgStringList &TempFiles = C.getTempFiles();
if (TempFiles.empty()) {		if (TempFiles.empty()) {
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "Error generating preprocessed source(s).";		<< "Error generating preprocessed source(s).";
return;		return {};
}		}

std::unique_ptr<llvm::TarWriter> TarWriter;		std::unique_ptr<llvm::TarWriter> TarWriter;
std::string ReproFileName;		std::string ReproFileName;
std::tie(TarWriter, ReproFileName) = openReproTarFile(*this, C);		std::tie(TarWriter, ReproFileName) = openReproTarFile(*this, C);
if (!TarWriter) {		if (!TarWriter) {
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "Error generating preprocessed source(s). - cannot open tar file";		<< "Error generating preprocessed source(s). - cannot open tar file";
return;		return {};
}		}

std::function<void(StringRef, std::string)> WriteFileToTar =		std::function<void(StringRef, std::string)> WriteFileToTar =
[&WriteFileToTar, &TarWriter](StringRef FSPath, std::string TarPath) {		[&WriteFileToTar, &TarWriter](StringRef FSPath, std::string TarPath) {
if (llvm::sys::fs::is_directory(FSPath)) {		if (llvm::sys::fs::is_directory(FSPath)) {
using llvm::sys::fs::recursive_directory_iterator;		using llvm::sys::fs::recursive_directory_iterator;
std::error_code EC;		std::error_code EC;
for (recursive_directory_iterator I{FSPath, EC}, E; I != E && !EC;		for (recursive_directory_iterator I{FSPath, EC}, E; I != E && !EC;
▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	Driver::generateReproducerFile(Compilation &C, const Command *FailingCommand,
CrashReportInfo CrashInfo(TempFiles[0], VFS);		CrashReportInfo CrashInfo(TempFiles[0], VFS);

llvm::SmallString<128> Script(CrashInfo.Filename);		llvm::SmallString<128> Script(CrashInfo.Filename);
std::string Storage;		std::string Storage;
llvm::raw_string_ostream ScriptOS{Storage};		llvm::raw_string_ostream ScriptOS{Storage};
ScriptOS << "# Crash reproducer for " << getClangFullVersion() << "\n"		ScriptOS << "# Crash reproducer for " << getClangFullVersion() << "\n"
<< "# Driver args: ";		<< "# Driver args: ";
printArgList(ScriptOS, C.getInputArgs());		printArgList(ScriptOS, C.getInputArgs());
		if (Cmd) {
ScriptOS << "# Original command: ";		ScriptOS << "# Original command: ";
Cmd.Print(ScriptOS, "\n", /Quote=/true);		Cmd->Print(ScriptOS, "\n", /Quote=/true);
		Cmd->Print(ScriptOS, "\n", /Quote=/true, &CrashInfo);
		} else {
		for (auto Cmd : C.getJobs()) {
		ScriptOS << "# " << Cmd.getCreator().getName();
Cmd.Print(ScriptOS, "\n", /Quote=/true, &CrashInfo);		Cmd.Print(ScriptOS, "\n", /Quote=/true, &CrashInfo);
		}
		}
if (!AdditionalInformation.empty())		if (!AdditionalInformation.empty())
ScriptOS << "\n# Additional information: " << AdditionalInformation << "\n";		ScriptOS << "\n# Additional information: " << AdditionalInformation << "\n";
if (Report)		if (Report)
Report->TemporaryFiles.push_back(std::string(Script.str()));		Report->TemporaryFiles.push_back(std::string(Script.str()));
TarWriter->append("repro.sh", Storage);		TarWriter->append("repro.sh", Storage);

Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "\n********************\n\n"
"PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:\n"
"Preprocessed source(s) and associated run script(s) are located at:";

Diag(clang::diag::note_drv_command_failed_diag_msg) << ReproFileName;

// On darwin, provide information about the .crash diagnostic report.		// On darwin, provide information about the .crash diagnostic report.
if (llvm::Triple(llvm::sys::getProcessTriple()).isOSDarwin()) {		if (llvm::Triple(llvm::sys::getProcessTriple()).isOSDarwin()) {
SmallString<128> CrashDiagDir;		SmallString<128> CrashDiagDir;
if (getCrashDiagnosticFile(ReproCrashFilename, CrashDiagDir)) {		if (getCrashDiagnosticFile(ReproCrashFilename, CrashDiagDir)) {
WriteFileToTar(ReproCrashFilename.str(), "");		WriteFileToTar(ReproCrashFilename.str(), "");
} else { // Suggest a directory for the user to look for .crash files.		} else { // Suggest a directory for the user to look for .crash files.
llvm::sys::path::append(CrashDiagDir, Name);		llvm::sys::path::append(CrashDiagDir, Name);
CrashDiagDir += "_<YYYY-MM-DD-HHMMSS>_<hostname>.crash";		CrashDiagDir += "_<YYYY-MM-DD-HHMMSS>_<hostname>.crash";
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "Crash backtrace is located in";		<< "Crash backtrace is located in";
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< CrashDiagDir.str();		<< CrashDiagDir.str();
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "(choose the .crash file that corresponds to your crash)";		<< "(choose the .crash file that corresponds to your crash)";
}		}
}		}

		return ReproFileName;
		}

		// When clang crashes, produce diagnostic information including the fully
		// preprocessed source file(s). Request that the developer attach the
		// diagnostic information to a bug report.
		void Driver::generateCompilationDiagnostics(
		Compilation &C, const Command &FailingCommand,
		StringRef AdditionalInformation, CompilationDiagnosticReport *Report) {
		if (C.getArgs().hasArg(options::OPT_fno_crash_diagnostics))
		return;

		// Don't try to generate diagnostics for link or dsymutil jobs.
		if (FailingCommand.getCreator().isLinkJob() \|\|
		FailingCommand.getCreator().isDsymutilJob())
		return;

		// Print the version of the compiler.
		PrintVersion(C, llvm::errs());

		Diag(clang::diag::note_drv_command_failed_diag_msg)
		<< "\n********************\n\n"
		"PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:\n"
		"Preprocessed source(s) and associated run script(s) are located at:";

		llvm::Optional<std::string> ReproFileName =
		generateReproducerFile(C, &FailingCommand, AdditionalInformation, Report);
		if (ReproFileName)
		Diag(clang::diag::note_drv_command_failed_diag_msg) << *ReproFileName;

Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "\n\n********************";		<< "\n\n********************";
}		}

void Driver::setUpResponseFiles(Compilation &C, Command &Cmd) {		void Driver::setUpResponseFiles(Compilation &C, Command &Cmd) {
// Since commandLineFitsWithinSystemLimits() may underestimate system's		// Since commandLineFitsWithinSystemLimits() may underestimate system's
// capacity if the tool does not support response files, there is a chance/		// capacity if the tool does not support response files, there is a chance/
// that things will just work without a response file, so we silently just		// that things will just work without a response file, so we silently just
▲ Show 20 Lines • Show All 4,359 Lines • Show Last 20 Lines

clang/test/Driver/emit-reproducer.c

This file was added.

				// RUN: rm -rf %t
				// RUN: mkdir %t
				hansUnsubmitted Done Reply Inline Actions I'm not sure if lit handles that semicolon, or if it hands this over to the shell, in which case it won't work on windows. Instead, `rm -rf %t && mkdir %t` seems common among clang tests. Otherwise, this is a nice test file :-) hans: I'm not sure if lit handles that semicolon, or if it hands this over to the shell, in which…
				abrachetAuthorUnsubmitted Done Reply Inline Actions Thanks :) updated it in the commit. abrachet: Thanks :) updated it in the commit.

				// Check that reproducers aren't created when -femit-reproducer=off
				// RUN: not %clang -DBODY=error %s -femit-reproducer=off -fcrash-diagnostics-dir=%t
				// RUN: test ! -e %t/*.tar

				// Check that reproducers aren't created when -femit-reproducer isn't specified
				// and the compiler didn't crash.
				// RUN: not %clang -DBODY=error %s -fcrash-diagnostics-dir=%t
				// RUN: test ! -e %t/*.tar

				// Check that reproducers aren't created when no error occured
				// RUN: %clang %s -femit-reproducer=off -fcrash-diagnostics-dir=%t
				// RUN: test ! -e %t/*.tar
				// RUN: %clang %s -femit-reproducer=on-error -fcrash-diagnostics-dir=%t
				// RUN: test ! -e %t/*.tar

				// Check reproducers are created when an error occured
				// RUN: not %clang -DBODY=error %s -femit-reproducer=on-error -fcrash-diagnostics-dir=%t
				// RUN: test -e %t/*.tar
				// RUN: rm -f %t/*.tar

				// RUN: %clang %s -femit-reproducer=always -fcrash-diagnostics-dir=%t
				// RUN: tar tf %t/*.tar \| sort \| FileCheck %s --check-prefix=TAR-LAYOUT
				MaskRayUnsubmitted Done Reply Inline Actions This is the first time `tar` is used in a test. I am unsure whether all Windows bots will provide it. Probably prepared that you may need `UNSUPPORTED: system-windows` MaskRay: This is the first time `tar` is used in a test. I am unsure whether all Windows bots will…
				hansUnsubmitted Done Reply Inline Actions Many lld tests use tar already, also on Windows. (e.g. lld/test/ELF/reproduce-error.s which doesn't have any special requirements), so hopefully it will work for Clang tests too. hans: Many lld tests use tar already, also on Windows. (e.g. lld/test/ELF/reproduce-error.s which…
				// RUN: tar xOf %t/.tar --wildcards "/input/emit-reproducer.c" > %t.c
				// RUN: diff %t.c %s
				// RUN: rm -f %t/*.tar

				// TAR-LAYOUT: {{.*}}/input/emit-reproducer.c
				// TAR-LAYOUT-NEXT: {{.*}}/repro.sh
				// TAR-LAYOUT-NEXT: {{.}}/tmp/emit-reproducer-{{.}}.c

				#ifndef BODY
				#define BODY
				#endif

				int main() {
				BODY;
				}
				No newline at end of file
				MaskRayUnsubmitted Done Reply Inline Actions No newline at end of file MaskRay: No newline at end of file

clang/tools/driver/driver.cpp

Show First 20 Lines • Show All 476 Lines • ▼ Show 20 Lines	int main(int Argc, const char **Argv) {

if (!UseNewCC1Process) {		if (!UseNewCC1Process) {
TheDriver.CC1Main = &ExecuteCC1Tool;		TheDriver.CC1Main = &ExecuteCC1Tool;
// Ensure the CC1Command actually catches cc1 crashes		// Ensure the CC1Command actually catches cc1 crashes
llvm::CrashRecoveryContext::Enable();		llvm::CrashRecoveryContext::Enable();
}		}

std::unique_ptr<Compilation> C(TheDriver.BuildCompilation(Args));		std::unique_ptr<Compilation> C(TheDriver.BuildCompilation(Args));

		enum ReproLevel {
		brunoUnsubmitted Done Reply Inline Actions Can you relate this to a enum here already? Perhaps use some bits for the level style and one to track using TAR? bruno: Can you relate this to a enum here already? Perhaps use some bits for the level style and one…
		MaskRayUnsubmitted Done Reply Inline Actions Consider `enum class` to not add common names like `Off` `Always` to the class. MaskRay: Consider `enum class` to not add common names like `Off` `Always` to the class.
		Off,
		OnCrash,
		OnError,
		Always,
		};

		xbolva00Unsubmitted Not Done Reply Inline Actions Make default value configurable with cmake variable? xbolva00: Make default value configurable with cmake variable?
		ReproLevel ReproLvl = OnCrash;
		hansUnsubmitted Done Reply Inline Actions Should we reject or at least warn about invalid arguments here? hans: Should we reject or at least warn about invalid arguments here?
		bool ReproEmitted = false;
		if (Arg *A = C->getArgs().getLastArg(options::OPT_femit_reproducer))
		ReproLvl = llvm::StringSwitch<ReproLevel>(A->getValue())
		.Case("off", Off)
		.Case("on-crash", OnCrash)
		.Case("on-error", OnError)
		.Case("always", Always)
		.Default(OnCrash);

int Res = 1;		int Res = 1;
bool IsCrash = false;		bool IsCrash = false;
if (C && !C->containsError()) {		if (C && !C->containsError()) {
SmallVector<std::pair<int, const Command *>, 4> FailingCommands;		SmallVector<std::pair<int, const Command *>, 4> FailingCommands;
		hansUnsubmitted Done Reply Inline Actions The old code seems to pretend every command failed? hans: The old code seems to pretend every command failed?
		abrachetAuthorUnsubmitted Done Reply Inline Actions It later looped through all failing commands and then if it was a crash, which it made them seem, and then would emit the reproducer and then break out of the loop. So it essentially only acted as if the first command failed. abrachet: It later looped through all failing commands and then if it was a crash, which it made them…
Res = TheDriver.ExecuteCompilation(*C, FailingCommands);		Res = TheDriver.ExecuteCompilation(*C, FailingCommands);

// Force a crash to test the diagnostics.		// Force a crash to test the diagnostics.
if (TheDriver.GenReproducer) {		if (TheDriver.GenReproducer) {
		brunoUnsubmitted Done Reply Inline Actions This is the same path as `-femit-reproducer=always` minus a few specific things, this path could be unified. bruno: This is the same path as `-femit-reproducer=always` minus a few specific things, this path…
Diags.Report(diag::err_drv_force_crash)		Diags.Report(diag::err_drv_force_crash)
<< !::getenv("FORCE_CLANG_DIAGNOSTICS_CRASH");		<< !::getenv("FORCE_CLANG_DIAGNOSTICS_CRASH");

// Pretend that every command failed.		// Pretend that every command failed.
FailingCommands.clear();		FailingCommands.clear();
for (const auto &J : C->getJobs())		for (const auto &J : C->getJobs())
if (const Command *C = dyn_cast<Command>(&J))		if (const Command *C = dyn_cast<Command>(&J))
FailingCommands.push_back(std::make_pair(-1, C));		FailingCommands.push_back(std::make_pair(-1, C));
Show All 20 Lines
#endif		#endif
#if LLVM_ON_UNIX		#if LLVM_ON_UNIX
// When running in integrated-cc1 mode, the CrashRecoveryContext returns		// When running in integrated-cc1 mode, the CrashRecoveryContext returns
// the same codes as if the program crashed. See section "Exit Status for		// the same codes as if the program crashed. See section "Exit Status for
// Commands":		// Commands":
// https://pubs.opengroup.org/onlinepubs/9699919799/xrat/V4_xcu_chap02.html		// https://pubs.opengroup.org/onlinepubs/9699919799/xrat/V4_xcu_chap02.html
IsCrash \|= CommandRes > 128;		IsCrash \|= CommandRes > 128;
#endif		#endif
if (IsCrash) {		if (IsCrash && ReproLvl != Off) {
TheDriver.generateCompilationDiagnostics(C, FailingCommand);		TheDriver.generateCompilationDiagnostics(C, FailingCommand);
		ReproEmitted = true;
break;		break;
		} else if (ReproLvl >= OnError) {
		llvm::Optional<std::string> ReproFile =
		TheDriver.generateReproducerFile(*C);
		if (!ReproFile)
		break;
		llvm::errs() << "Reproducer file emitted in: " << *ReproFile << '\n';
		ReproEmitted = true;
		}
}		}
}		}

		if (ReproLvl == Always && !ReproEmitted) {
		llvm::Optional<std::string> ReproFile =
		TheDriver.generateReproducerFile(*C);
		if (ReproFile)
		llvm::errs() << "Reproducer file emitted in: " << *ReproFile << '\n';
}		}

Diags.getClient()->finish();		Diags.getClient()->finish();

if (!UseNewCC1Process && IsCrash) {		if (!UseNewCC1Process && IsCrash) {
// When crashing in -fintegrated-cc1 mode, bury the timer pointers, because		// When crashing in -fintegrated-cc1 mode, bury the timer pointers, because
// the internal linked list might point to already released stack frames.		// the internal linked list might point to already released stack frames.
llvm::BuryPointer(llvm::TimerGroup::aquireDefaultGroup());		llvm::BuryPointer(llvm::TimerGroup::aquireDefaultGroup());
Show All 19 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Clang] Extend -gen-reproducer flagClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 417696

clang/include/clang/Driver/Driver.h

clang/include/clang/Driver/Options.td

clang/lib/Driver/Driver.cpp

clang/test/Driver/emit-reproducer.c

clang/tools/driver/driver.cpp

[Clang] Extend -gen-reproducer flag
ClosedPublic