This is an archive of the discontinued LLVM Phabricator instance.

[llvm-stress] Add a mutation fuzzing mode
Needs ReviewPublic

Authored by reames on Dec 16 2022, 11:24 AM.

Details

Summary

This change generalizes llvm-stress by adding a sub-command to mutate an input IR file. This involves splitting the existing functionality into a sub-command of it's own (though its still the default).

The main change for the existing behavior is the following:

  1. An optional "generate" command is added. You can continue to not specify a sub-command, and we default to "generate" (i.e. legacy behavior.)
  2. Several cl::opts which are specific to generate are moved under that sub-command. This is unfortunately, a breaking change. These options will no longer be accepted without an explicit sub-command being given, and the help text is moved under "llvm-stress help generate".

The newly added mode exposes the existing mutation logic used by our various libfuzzer based fuzzers. This allows the logic to be exercised, and used, when not linking against libfuzzer.

p.s. I know the command guide page needs rewritten. I figured I'd get initial feedback on the approach before doing so. That's definitely a blocker item for landing this.

Diff Detail

Event Timeline

reames created this revision.Dec 16 2022, 11:24 AM
Herald added a project: Restricted Project. · View Herald TranscriptDec 16 2022, 11:24 AM
reames requested review of this revision.Dec 16 2022, 11:24 AM
Herald added a project: Restricted Project. · View Herald TranscriptDec 16 2022, 11:24 AM

Is there much overlap between llvm-stress generate and llvm-stress mutate? Maybe that should just be a separate llvm-mutate tool? (Just throwing this out there, not advocating for anything in particular...)

Peter added a comment.EditedDec 16 2022, 11:43 AM

I have been working on FuzzMutate for half a year.
To FuzzMutate, generation and mutation is the same... generation is just initialize an empty module and them mutate.
So I don't think the function Generate() is necessary.
What you can do is initialize an empty Module("M", LLVMContext) and throw it to FuzzMutate.

Also, many features in Generate() have already been introduced into FuzzMutate.
IntroduceControlFlow is InsertCFGStrategy, works better if you add InsertPHIStrategy too.
FillFunction is InjectorStrategy
GenEmptyFunction is InsertFunctionCallStrategy, this strategy haven't been upstreamed yet. I am waiting for other diffs(https://reviews.llvm.org/D139894, :https://reviews.llvm.org/D139907) to be accepted before I send it for review.

I highly encourage you take look at IRMutator.h to see what strategies have been added.

Is there much overlap between llvm-stress generate and llvm-stress mutate? Maybe that should just be a separate llvm-mutate tool? (Just throwing this out there, not advocating for anything in particular...)

I'm open to either option here.

My thinking went as follows:

  1. They both related to generating test cases for fuzzing. (i.e the "stress" part of the name)
  2. The generate mode is simply a mutator starting with an empty function. At the moment, the code is separate, but it might make sense to merge parts. (Or not, no concrete plans here.) From a usage perspective, a user might want to use both llvm-stress generate, and llvm-stress mutate (on an empty input) at some experimentally established frequency. (As right now, the two modes generate different patterns at different frequencies.)
  3. As we adjust control knobs (i.e size, types, enable-disable flags), we're more likely to have a consistent interface if the tools in the same source.

I highly encourage you take look at IRMutator.h to see what strategies have been added.

I have. That's what led to the motivation to expose this via a CLI tool. (i.e. this patch)

Peter added a comment.EditedDec 16 2022, 11:56 AM

I think another interesting thing you can add is "How much do you want to mutate". mutateModule will only change the module once, i.e. insert one instruction, add one function(may be no one is using it), etc.

Can we add a command line arg --num-mutate= and default to, say 100, so we mutate the module 100 times before we return it.

Just a thought.

I think another interesting thing you can all is "How much do you want to mutate". mutateModule will only change the module once, i.e. insert one instruction, add one function(may be no one is using it).

Can we add a command line arg --num-mutate= and default to, say 100, so we mutate the module 100 times before we return it.

Just a thought.

Good idea. If you don't mind I'll leave that to a follow up commit. I also want to add some command line control of which strategies to select from. Again, in a separate commit.

I think another interesting thing you can all is "How much do you want to mutate". mutateModule will only change the module once, i.e. insert one instruction, add one function(may be no one is using it).

Can we add a command line arg --num-mutate= and default to, say 100, so we mutate the module 100 times before we return it.

Just a thought.

Good idea. If you don't mind I'll leave that to a follow up commit. I also want to add some command line control of which strategies to select from. Again, in a separate commit.

After writing this, I wondered how we'd been testing the new mutation strategies. The answer turned out to be, we weren't. This will give a means to write tests for individual mutations going forward as well.

Peter added a comment.Dec 16 2022, 1:05 PM

I think another interesting thing you can all is "How much do you want to mutate". mutateModule will only change the module once, i.e. insert one instruction, add one function(may be no one is using it).

Can we add a command line arg --num-mutate= and default to, say 100, so we mutate the module 100 times before we return it.

Just a thought.

Good idea. If you don't mind I'll leave that to a follow up commit. I also want to add some command line control of which strategies to select from. Again, in a separate commit.

After writing this, I wondered how we'd been testing the new mutation strategies. The answer turned out to be, we weren't. This will give a means to write tests for individual mutations going forward as well.

Sorry I don't understand what do you mean by we are not testing mutation strategies.

If you mean unit testing, we actually have hand written modules and try it on strategies in llvm/unittests/FuzzMutate/StrategiesTest.cpp.
If you mean more large scale end-to-end testing, I have been using these strategies the whole winter so they should be good.

I think another interesting thing you can all is "How much do you want to mutate". mutateModule will only change the module once, i.e. insert one instruction, add one function(may be no one is using it).

Can we add a command line arg --num-mutate= and default to, say 100, so we mutate the module 100 times before we return it.

Just a thought.

Good idea. If you don't mind I'll leave that to a follow up commit. I also want to add some command line control of which strategies to select from. Again, in a separate commit.

After writing this, I wondered how we'd been testing the new mutation strategies. The answer turned out to be, we weren't. This will give a means to write tests for individual mutations going forward as well.

Sorry I don't understand what do you mean by we are not testing mutation strategies.

If you mean unit testing, we actually have hand written modules and try it on strategies in llvm/unittests/FuzzMutate/StrategiesTest.cpp.
If you mean more large scale end-to-end testing, I have been using these strategies the whole winter so they should be good.

Oops, my apologies. I'd glanced at a recent commit and looked for changes in test/. I had not spotted the unit test. That's entirely my error.

Peter added a comment.Dec 16 2022, 1:29 PM

I think another interesting thing you can all is "How much do you want to mutate". mutateModule will only change the module once, i.e. insert one instruction, add one function(may be no one is using it).

Can we add a command line arg --num-mutate= and default to, say 100, so we mutate the module 100 times before we return it.

Just a thought.

Good idea. If you don't mind I'll leave that to a follow up commit. I also want to add some command line control of which strategies to select from. Again, in a separate commit.

After writing this, I wondered how we'd been testing the new mutation strategies. The answer turned out to be, we weren't. This will give a means to write tests for individual mutations going forward as well.

Sorry I don't understand what do you mean by we are not testing mutation strategies.

If you mean unit testing, we actually have hand written modules and try it on strategies in llvm/unittests/FuzzMutate/StrategiesTest.cpp.
If you mean more large scale end-to-end testing, I have been using these strategies the whole winter so they should be good.

Oops, my apologies. I'd glanced at a recent commit and looked for changes in test/. I had not spotted the unit test. That's entirely my error.

NP. I'd also start from test/. Its a myth to me why the framework author put them in unittest/ so I have to follow suit when writing new tests :)

nikic added a comment.Dec 19 2022, 7:10 AM

Is there much overlap between llvm-stress generate and llvm-stress mutate? Maybe that should just be a separate llvm-mutate tool? (Just throwing this out there, not advocating for anything in particular...)

I'm open to either option here.

My thinking went as follows:

  1. They both related to generating test cases for fuzzing. (i.e the "stress" part of the name)
  2. The generate mode is simply a mutator starting with an empty function. At the moment, the code is separate, but it might make sense to merge parts. (Or not, no concrete plans here.) From a usage perspective, a user might want to use both llvm-stress generate, and llvm-stress mutate (on an empty input) at some experimentally established frequency. (As right now, the two modes generate different patterns at different frequencies.)
  3. As we adjust control knobs (i.e size, types, enable-disable flags), we're more likely to have a consistent interface if the tools in the same source.

I think my main concern would be that llvm-stress generate is *not* just the same as llvm-stress mutate with an empty module -- it uses a different generation method with very different characteristics, so this might be a bit confusing.

Don't feel strong about this though, and the code looks fine to me. Might make sense to add a test with a fixed seed just as a basic sanity check?

llvm/tools/llvm-stress/llvm-stress.cpp
812

Any particular reason why this does not include all strategies?

bjope added a subscriber: uabelho.Dec 21 2022, 2:38 PM

I do not plan to give any review feedback at the moment. I've only touched this for legacy PM deprecation and I don't know much about the implementation details.

We do use llvm-stress downstream in some testing. But as long as we just use "generate" it looks like everything will be fine.

@uabelho: This will impact the options we've added downstream slightly, as we need to put them in the "subopts::Generate" category. But afaict that will be quite trivial.

bjope added inline comments.Dec 21 2022, 3:28 PM
llvm/tools/llvm-stress/llvm-stress.cpp
68

Maybe enough to say "Randomly mutates IR provided on standard in".

(to get rid of the repeated use of "provided")

I do not plan to give any review feedback at the moment. I've only touched this for legacy PM deprecation and I don't know much about the implementation details.

We do use llvm-stress downstream in some testing. But as long as we just use "generate" it looks like everything will be fine.

@uabelho: This will impact the options we've added downstream slightly, as we need to put them in the "subopts::Generate" category. But afaict that will be quite trivial.

Since I'm developing FuzzMutate most of my time, I am interested how you use llvm-stress downstream? If you have anything particular you want to generate I may write a strategy here upstream.