This is an archive of the discontinued LLVM Phabricator instance.

[clang-tidy] Create properly seeded random generator check
ClosedPublic

Authored by boga95 on Mar 6 2018, 3:58 AM.

Details

Summary

This check flags all pseudo-random number engines and engine adaptors instantiations when it initialized or seeded with default argument or a constant expression. Pseudo-random number engines seeded with a predictable value may cause vulnerabilities e.g. in security protocols.
This is a CERT security rule, see MSC51-CPP.

Example:

void foo() {
    std::mt19937 engine1; // Bad, always generate the same sequence
    std::mt19937 engine2(1); // Bad
    engine1.seed(); // Bad
    engine2.seed(1); // Bad
    
    std::time_t t;
    engine1.seed(std::time(&t)); // Bad, system time might be controlled by user

    std::random_device dev;
    std::mt19937 engine3(dev()); // Good
}

Diff Detail

Event Timeline

boga95 created this revision.Mar 6 2018, 3:58 AM

Thank you for working on this check!

Do you think it might be possible to also add a check for cert-msc32-c that handles the C side of things, and use a common check to implement both? C won't ever have the C++'isms, but C++ can certainly abuse std::srand() so it seems like there's at least some overlap.

clang-tidy/cert/CERTTidyModule.cpp
43

The name should be cert-msc51-cpp (and categorized with the other msc checks).

clang-tidy/cert/ProperlySeededRandomGeneratorCheck.cpp
23–25

These should be using fully-qualified names, like ::std::mersenne_twister_engine. Also, I think this list should be configurable so that users can add their own engines if needed.

67

Formatting looks off here.

72

The diagnostics here aren't quite correct -- a random_device isn't *required*. For instance, it's also fine to open up /dev/random and read a seed's worth of bytes. I think a better phrasing might be: random number generator seeded with a default argument will generate a predictable sequence of values.

78–79

I'd probably use similar wording here: random number generator seeded with a constant value will generate a predictable sequence of values.

You can combine these diagnostics and use %select{}0 to choose between the variants.

clang-tidy/cert/ProperlySeededRandomGeneratorCheck.h
34–35

It seems like this should be a private function rather than public.

docs/clang-tidy/checks/cert-properly-seeded-random-generator.rst
3 ↗(On Diff #137151)

When renaming the check, be sure to update titles, file names, etc.

7 ↗(On Diff #137151)

Remove "it".

22–23 ↗(On Diff #137151)

There's no test case for this test, and I don't think this code would generate a diagnostic in the current check form. This might require a (user-configurable) list of disallowed sources of seed values. For instance, it should diagnose when called with a time_t value or a direct call to a time function, but it need not diagnose if the value is somehow obscured. e.g.,

unsigned get_seed() { std::time_t t; return std::time(&t); }
engine1.seed(get_seed()); // No diagnostic required
aaron.ballman edited reviewers, added: aaron.ballman, alexfh; removed: Restricted Project.Mar 6 2018, 5:52 AM
aaron.ballman removed a subscriber: aaron.ballman.
Quuxplusone added inline comments.
docs/clang-tidy/checks/cert-properly-seeded-random-generator.rst
26 ↗(On Diff #137151)

Seeding MT19937 with a single 32-bit integer is not "Good". It makes the seed super easy to brute-force; and for example, engine3 will never produce 7 or 13 as its first output.
http://www.pcg-random.org/posts/cpp-seeding-surprises.html

This doesn't affect the implementation or usefulness of this clang-tidy check, which is pretty nifty. I merely object to marking this sample code with the comment "Good" in official documentation. It should be marked "Will not warn" at best. Or replace it with something slightly more realistic, e.g.

int x = atoi(argv[1]);
std::mt19937 engine3(x);  // Will not warn

As Aaron said above, seeding with the current time is approximately as good an idea, and "will not warn" with the current diagnostic either.

The correct way to seed a PRNG is to initialize the entire state with random bits, not just 32 bits of the state. This can be done, but not yet in standard C++: http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2016/p0205r0.html

test/clang-tidy/cert-properly-seeded-random-generator.cpp
76 ↗(On Diff #137151)

Is the diagnostic suppressed if seed is a template parameter? (Not that I'd do this. It's just a corner case I thought of.)

boga95 updated this revision to Diff 137915.Mar 10 2018, 10:24 AM
boga95 marked an inline comment as done.

Add capability to provide a user-configurable list of disallowed types which will trigger warning. Now it also detects C srand function. Rename check to cert-msc51-cpp. Add cert-msc32-c check. Change warning messages. Other minor fixes.

lebedev.ri retitled this revision from Create properly seeded random generator check to [clang-tidy] Create properly seeded random generator check.Mar 10 2018, 10:27 AM
boga95 marked 4 inline comments as done.Mar 12 2018, 12:42 PM
boga95 added a subscriber: szepet.
boga95 added inline comments.
test/clang-tidy/cert-properly-seeded-random-generator.cpp
76 ↗(On Diff #137151)

Hopefully, it is not suppressed.

aaron.ballman added inline comments.Mar 12 2018, 5:20 PM
clang-tidy/cert/CERTTidyModule.cpp
40

This change looks unrelated to the patch.

44

This change looks unrelated to the patch.

clang-tidy/cert/ProperlySeededRandomGeneratorCheck.cpp
36–38

These should be full-qualified names.

75

I think that in C mode, this is fine, but in C++ mode it should register ::std::srand.

115

You can use llvm::find() instead.

docs/clang-tidy/checks/cert-msc51-cpp.rst
8

Backticks around srand

12

Should link to the CERT rule, and also link to the C CERT rule.

Eugene.Zelenko added inline comments.
docs/ReleaseNotes.rst
63

Please replace srand with `srand ()`.

docs/clang-tidy/checks/cert-msc51-cpp.rst
8

Two of them and please add (). Will be good idea to make first statement same as in Release Notes.

30

Is there guideline documentation available online? If so, please add link. See other checks documentation as example.

alexfh removed a reviewer: alexfh.Mar 14 2018, 6:08 AM
boga95 updated this revision to Diff 144336.Apr 27 2018, 6:53 AM

This is getting closer, but it seems there are still unresolved comments from reviewers. Are you planning to resolve those as well?

dkrupp added a subscriber: dkrupp.May 25 2018, 4:12 AM
boga95 marked 12 inline comments as done.Jun 24 2018, 11:47 AM

I think I resolved all of the comments. Do I forget anything?

clang-tidy/cert/CERTTidyModule.cpp
44

Clang format did it when I apply it to the whole file.

clang-tidy/cert/ProperlySeededRandomGeneratorCheck.cpp
75

It is not match for ::std::srand, just for ::srand. I found some examples, but I think they don't work neither.

docs/clang-tidy/checks/cert-msc51-cpp.rst
30

There is a guideline here.

aaron.ballman added inline comments.Jun 25 2018, 5:17 AM
clang-tidy/cert/CERTTidyModule.cpp
44

You should clang-format the patch, not the entire file. See https://clang.llvm.org/docs/ClangFormat.html#script-for-patch-reformatting for details.

clang-tidy/cert/ProperlySeededRandomGeneratorCheck.cpp
75

As it stands, I'm not certain whether this will match code like std::srand(0);. Please add a test case to the C++ tests for it as it should be handled.

docs/clang-tidy/checks/cert-msc51-cpp.rst
8

This comment still has not been handled. Also, please remove the space between srand and the parens.

boga95 updated this revision to Diff 152778.Jun 25 2018, 2:48 PM
boga95 marked 3 inline comments as done.

Add std::srand check to C++ tests. Format patch.

aaron.ballman added inline comments.Jun 27 2018, 11:08 AM
clang-tidy/cert/ProperlySeededRandomGeneratorCheck.cpp
25

In order to conform to CERT's rules, I think this needs a default string of at least time_t and std::time_t.

docs/ReleaseNotes.rst
60

You should also add a note for cert-msc32-c.

docs/clang-tidy/checks/cert-msc51-cpp.rst
8

Please add double backticks around srand() instead of single backticks.

29

Instead of using "Bad" in the comments, it would be good to use "diagnose" or "error" instead.

Note that for this one, in particular, the default behavior is to *not* diagnose, which means this check doesn't really conform to CERT's rules.

boga95 updated this revision to Diff 153169.Jun 27 2018, 1:45 PM
boga95 marked 3 inline comments as done.
boga95 updated this revision to Diff 153172.Jun 27 2018, 1:53 PM
boga95 marked 3 inline comments as done.
aaron.ballman accepted this revision.Jun 28 2018, 10:10 AM

LGTM with a small documentation nit.

docs/clang-tidy/checks/cert-msc51-cpp.rst
40

You should list the default values here explicitly.

This revision is now accepted and ready to land.Jun 28 2018, 10:10 AM
boga95 updated this revision to Diff 153655.Jul 1 2018, 1:52 PM

This still LGTM; do you need someone to commit on your behalf?

How can I commit it?

docs/clang-tidy/checks/cert-msc51-cpp.rst
8

Should I use double backticks in release notes too?

aaron.ballman closed this revision.Jul 4 2018, 6:22 PM

How can I commit it?

You need to have obtained commit access first. I went ahead and committed on your behalf in r336301. Thank you for the patch!

docs/clang-tidy/checks/cert-msc51-cpp.rst
8

Good catch; I'll add those when I commit.