Add a basic fuzz testing harness for Dilithium2 #1905

nathaniel-brough · 2024-08-27T02:32:59Z

The purpose of this PR is to improve the overall security of liboqs. This is by adding a very basic fuzz as described here #1215.

This PR just adds a new style of test i.e. a fuzz test and therefore doesn't need additional unit tests. Integrating this into the CI isn't currently possible but I've got a bit of a plan as to how to go ahead with it which I've described here.

Does this PR change the input/output behaviour of a cryptographic algorithm (i.e., does it change known answer test values)? (If so, a version bump will be required from x.y.z to x.(y+1).0.)
Does this PR change the list of algorithms available -- either adding, removing, or renaming? Does this PR otherwise change an API? (If so, PRs in fully supported downstream projects dependent on these, i.e., oqs-provider will also need to be ready for review and merge by the time this is merged.)

nathaniel-brough · 2024-08-27T07:20:26Z

I'm pretty happy with with where the fuzz harness is at. It's not anything particularly complicated, just essentially throws a bunch of messages at the dilithium api. I've got a little bit more work to do in integrating oss-fuzz before I want to merge this.

I plan on following up with more fuzzer's and more in depth testing e.g. constant time/algorithm fuzzing etc. But I'll just start with getting all the automation up and running with something super simple like this.

baentsch · 2024-08-27T08:11:56Z

I've got a little bit more work to do in integrating oss-fuzz before I want to merge this.

Do you also plan on adding more algorithms to the fray?

I'll just start with getting all the automation up and running

Sounds like a good plan.

Also some documentation (how to start and read results, say) would be very welcome.

nathaniel-brough · 2024-08-27T08:18:39Z

Do you also plan on adding more algorithms to the fray?

Yeah for sure. I plan on slowly chipping away at it, one algorithm at a time. I admit that I'm not familiar with most of these algorithms so it may take me some time. But I'm very much on top of how to write efficient fuzz-harnesses. So hopefully we'll end up with something that's useful at the end of it all.

Also some documentation (how to start and read results, say) would be very welcome.

Sure does the markdown docs under //docs get automatically pushed to the wiki or is there another repo somewhere for that?

baentsch · 2024-08-27T11:57:26Z

Yeah for sure. I plan on slowly chipping away at it, one algorithm at a time. I admit that I'm not familiar with most of these algorithms so it may take me some time.

You shouldn't have to be -- they're all behind the same API. In fact, lots of code in liboqs is generated -- and I'd argue this should also hold true for the fuzz harness. You may want to take a look at the copy_from_upstream logic....

Sure does the markdown docs under //docs get automatically pushed to the wiki or is there another repo somewhere for that?

Nope. Our semi-formal agreement is to include documentation that is dependent on scripts in the repo there (in the repo, under "docs") and only leave code-independent stuff (procedures for release etc) in the Wiki.

CMakeLists.txt

baentsch

Conceptually LGTM. As it only supports a single algorithm (OK for feasibility test, not OK for a release), I'd hold off approving/merging this until 0.11.0 is out. Other opinions welcome.

baentsch · 2024-08-27T12:16:35Z

@silvergasp please check out https://github.com/open-quantum-safe/liboqs/blob/main/CONTRIBUTING.md#coding-style

dstebila · 2024-08-27T14:09:48Z

Thanks for the great work, Nathaniel! This is a great start.

I don't have any experience with fuzzing, but your plan in #1215 (comment) sounds promising.

Obviously we would eventually want fuzzing against as many algorithms as possible, and as Michael points out we have a code generation system that can help with that.

Conceptually LGTM. As it only supports a single algorithm (OK for feasibility test, not OK for a release), I'd hold off approving/merging this until 0.11.0 is out. Other opinions welcome.

@baentsch I don't think it harms the 0.11.0 release to have this landed even if it's only for one algorithm, though we'd want to be careful to make sure we word things in a way that users don't think all algorithms have been fuzzed. E.g. change the title of the PR/commit to "Add a basic fuzz testing harness for Dilithium2". I wouldn't hold 0.11.0 for this, but we're not in a code-freeze yet either so I don't want to block progress on this.

baentsch · 2024-08-27T14:20:17Z

change the title of the PR/commit to "Add a basic fuzz testing harness for Dilithium2".

With that (and CI passing) I'd agree to merge, too.

nathaniel-brough · 2024-08-29T01:59:10Z

Also some documentation (how to start and read results, say) would be very welcome.

Done

@silvergasp please check out https://github.com/open-quantum-safe/liboqs/blob/main/CONTRIBUTING.md#coding-style

Done, at least I think I've got it right this time.

Obviously we would eventually want fuzzing against as many algorithms as possible, and as Michael points out we have a code generation system that can help with that.

I've only given it a cursory read for now, and it's not entirely obvious to me how I would integrate this with fuzzing harnesses. But I'll have time to take a deeper dive with code generation next week.

E.g. change the title of the PR/commit to "Add a basic fuzz testing harness for Dilithium2". I wouldn't hold 0.11.0 for this, but we're not in a code-freeze yet either so I don't want to block progress on this.

Done. I've also added a 'current status section' to the fuzzing docs.

SWilson4

Thanks very much for this proposed contribution, @silvergasp! It would help to resolve one of our oldest outstanding issues.

I've commented on a few minor style points. I'd also like to see if we can have this running in CI. Probably the easiest way to do this is to add a python wrapper in the tests directory (similar to, say, test_leaks.py) and trigger it with a "fuzzing" CI job, which can be added to the big matrix in .github/workflows/linux.yml. Let me know if you need help with that. We just landed a big CI refactor this morning, so you will want to sync your fork before taking this on.

CONFIGURE.md

docs/FUZZING.md

CONFIGURE.md

SWilson4 · 2024-09-09T16:00:30Z

I'd also like to see if we can have this running in CI. Probably the easiest way to do this is to add a python wrapper in the tests directory (similar to, say, test_leaks.py) and trigger it with a "fuzzing" CI job, which can be added to the big matrix in .github/workflows/linux.yml. Let me know if you need help with that. We just landed a big CI refactor this morning, so you will want to sync your fork before taking this on.

Whoops, I should have read through the related issue (#1215) more carefully before suggesting this. If you have a different plan for getting this running in CI, please feel free to disregard this suggestion.

nathaniel-brough · 2024-09-10T05:46:37Z

Whoops, I should have read through the related issue (#1215) more carefully before suggesting this. If you have a different plan for getting this running in CI, please feel free to disregard this suggestion.

Yeah I've got a plan in mind :)

I've just accepted all your requested edit's but squashed them back into the original commit, adding you as co-author.

nathaniel-brough · 2024-09-16T23:51:48Z

Is there anything more that I need to address here?

docs/FUZZING.md

baentsch · 2024-09-17T05:40:45Z

Is there anything more that I need to address here?

Sorry for dropping the ball on my side, @silvergasp . I just triggered CI running and it failed with an error that seems triggered by the PR: Please check it out: Goal should be to get CI to green. Then, what about adding a CI test enabling the new option? Can there be a "short" test run to ascertain the fuzzing build is OK without spending hours in CI?

dstebila · 2024-09-17T16:50:02Z

I suggest we hold off on merging this until after the 0.11.0 release.

nathaniel-brough · 2024-09-23T01:01:55Z

Sorry for dropping the ball on my side, @silvergasp . I just triggered CI running and it failed with an error that seems triggered by the PR: Please check it out: Goal should be to get CI to green. Then, what about adding a CI test enabling the new option? Can there be a "short" test run to ascertain the fuzzing build is OK without spending hours in CI?

Not a problem. I've fixed the CI and tested it using a personal branch so that I could confirm that it indeed works as I don't have permissions to run the CI here.

As requested I've also added a CI build stage that builds the fuzz harness and runs it for 30s. I will follow up later with more sophisticated CI integration once the oss-fuzz stuff has gone ahead.

nathaniel-brough · 2024-09-23T01:05:58Z

I suggest we hold off on merging this until after the 0.11.0 release.

Easy, there is no immediate rush on my end. I'll be taking on new responsibilities in about a months time, so after that I may be a little slower to respond.

SWilson4

This looks good to me, but I will hold off on approving until we have 0.11.0 out the door.

tests/fuzz_test_dilithium2.c

SWilson4 · 2024-09-23T16:02:05Z

docs/FUZZING.md

+    - [ ] dilithium1
+    - [x] dilithium2


Suggested change

- [ ] dilithium1

- [x] dilithium2

- [x] dilithium2

- [ ] dilithium3

- [ ] dilithium5

Is there a dilithium4?

I've incorporated this change into the original commit leaving out dilithium4, as after some searching it doesn't look like it exists.

I just noticed that dilithium1 remains here—as far as I know it doesn't exist (and if it does, we don't have it in the library.

dstebila · 2024-10-01T00:00:12Z

I'm new to fuzzing, so one question I was wondering about is how one decides which inputs to fuzz. For example, I notice here that the fuzzing is done only on the message input of the signing and verifying algorithms. Would one ever want to fuzz the other inputs -- public key, secret key, signature? Why or why not? It's okay if the answer is yes, but you're just setting up a basic harness here, I just want to understand what scope of fuzzing is desirable.

nathaniel-brough · 2024-10-01T00:57:04Z

I'm new to fuzzing, so one question I was wondering about is how one decides which inputs to fuzz. For example, I notice here that the fuzzing is done only on the message input of the signing and verifying algorithms.

Ideally you'd fuzz as many inputs as possible.

Would one ever want to fuzz the other inputs -- public key, secret key, signature? Why or why not? It's okay if the answer is yes, but you're just setting up a basic harness here, I just want to understand what scope of fuzzing is desirable.

Yes, although this is a little more complicated so I haven't done it yet. The way that fuzzing works is it will generate a set of randomised bytes which is then passed into your fuzz harness a number of times. The fuzzer will then measure the code-coverage for each run, the inputs that produce the highest coverage are then randomly mutated to create new inputs. This is called coverage driven fuzzing. It looks something like this;

flowchart LR
    A[Random Input] --> B[Fuzz Harness]
    B --> C[Collect code coverage information]
    C -->  D[Mutate orgional input to maximize coverage] 
    D --> B

The reason that I've opted to NOT fuzz the public/private keys yet is because it would take a really long time for the fuzzer to "guess" at a private/public key pair that actually correspond to each other, so it's very likely that the fuzzer would just be fuzzing the error handling code. Which is still useful and I may follow up with a separate harness that targets that, but in my opinion it is less useful than fuzzing the main code-path.

So in short you want to balance the ability for the fuzzer to "guess" at the "correct" inputs that unlock large sections of code-coverage against actually fuzzing all the inputs. There aren't any hard rules as to what should/shouldn't be fuzzed, it's mostly a mix of intuition and experimentation e.g. when you run a fuzzer you can see in real-time the number of "code-blocks" (the actual code between jump instructions e.g. if/else/function_calls) that the fuzzer has been able to run under the cov: column. So you can make a small change in your harness and run the fuzzer for a bit and compare the cov: field before/after. It's not really an exact science but it's significantly better than just guessing.

INFO: Seed: 1523017872
INFO: Loaded 1 modules (16 guards): [0x744e60, 0x744ea0),
INFO: -max_len is not provided, using 64
INFO: A corpus is not provided, starting from an empty corpus
#0    READ units: 1
#1    INITED cov: 3 ft: 2 corp: 1/1b exec/s: 0 rss: 24Mb
#3811 NEW    cov: 4 ft: 3 corp: 2/2b exec/s: 0 rss: 25Mb L: 1 MS: 5 ChangeBit-ChangeByte-ChangeBit-ShuffleBytes-ChangeByte-
#3827 NEW    cov: 5 ft: 4 corp: 3/4b exec/s: 0 rss: 25Mb L: 2 MS: 1 CopyPart-
#3963 NEW    cov: 6 ft: 5 corp: 4/6b exec/s: 0 rss: 25Mb L: 2 MS: 2 ShuffleBytes-ChangeBit-
#4167 NEW    cov: 7 ft: 6 corp: 5/9b exec/s: 0 rss: 25Mb L: 3 MS: 1 InsertByte-

Let me know if you have any further/follow-up questions.

dstebila · 2024-10-01T14:49:47Z

Thanks for the very helpful explanation, @silvergasp!

nathaniel-brough · 2024-10-15T00:29:32Z

Friendly ping. Is there anything blocking this PR that you'd like me to address?

.github/workflows/basic.yml

SWilson4 · 2024-10-15T14:23:20Z

docs/FUZZING.md

+    - [ ] dilithium1
+    - [x] dilithium2


I just noticed that dilithium1 remains here—as far as I know it doesn't exist (and if it does, we don't have it in the library.

SWilson4 · 2024-10-15T14:26:32Z

Friendly ping. Is there anything blocking this PR that you'd like me to address?

With two approvals (and 0.11.0 out the door) you are good to go. That said, I noticed a couple of trivial things on re-reading the diff (see comments above).

Co-authored-by: Spencer Wilson <[email protected]> Signed-off-by: Nathaniel Brough <[email protected]>

nathaniel-brough · 2024-10-16T05:50:12Z

@SWilson4 I've incorporated those small changes you mentioned into the two original commits and rebased it all onto main/HEAD.

SWilson4 · 2024-10-18T14:25:58Z

Just checking with @dstebila and @ryjones: does #1215 (comment) or any other outstanding issue need to be sorted out before this lands?

dstebila · 2024-10-18T16:48:02Z

Just checking with @dstebila and @ryjones: does #1215 (comment) or any other outstanding issue need to be sorted out before this lands?

Since the alias [email protected] is already in place, I think this can proceed independent of the domain transfer.

SWilson4 · 2024-10-18T17:16:37Z

Thanks for the contribution, @nathaniel-brough!

nathaniel-brough mentioned this pull request Aug 27, 2024

Add fuzzing testing #1215

Open

nathaniel-brough force-pushed the main branch 2 times, most recently from 0571f45 to e27055c Compare August 27, 2024 07:16

nathaniel-brough mentioned this pull request Aug 27, 2024

liboqs: Initial integration google/oss-fuzz#12408

Merged

nathaniel-brough marked this pull request as ready for review August 27, 2024 08:19

nathaniel-brough requested a review from dstebila as a code owner August 27, 2024 08:19

baentsch reviewed Aug 27, 2024

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

baentsch reviewed Aug 27, 2024

View reviewed changes

nathaniel-brough changed the title ~~Add a basic fuzz testing harness~~ Add a basic fuzz testing harness for Dilithium2 Aug 29, 2024

nathaniel-brough force-pushed the main branch from e27055c to 48d71b8 Compare August 29, 2024 01:52

SWilson4 reviewed Sep 9, 2024

View reviewed changes

nathaniel-brough force-pushed the main branch from 3950250 to 5e3baf6 Compare September 10, 2024 05:42

baentsch reviewed Sep 17, 2024

View reviewed changes

docs/FUZZING.md Outdated Show resolved Hide resolved

nathaniel-brough force-pushed the main branch from 5e3baf6 to b84b961 Compare September 23, 2024 00:54

SWilson4 reviewed Sep 23, 2024

View reviewed changes

nathaniel-brough force-pushed the main branch from b84b961 to 2c5bf6d Compare September 24, 2024 02:08

dstebila approved these changes Oct 1, 2024

View reviewed changes

SWilson4 approved these changes Oct 2, 2024

View reviewed changes

SWilson4 reviewed Oct 15, 2024

View reviewed changes

nathaniel-brough force-pushed the main branch from 8883e7d to b89ce67 Compare October 16, 2024 05:48

nathaniel-brough and others added 2 commits October 16, 2024 15:48

Add a basic fuzz testing harness for dilithium2

516a3de

Co-authored-by: Spencer Wilson <[email protected]> Signed-off-by: Nathaniel Brough <[email protected]>

Add basic build checks for fuzz tests

1a5b1f1

Co-authored-by: Spencer Wilson <[email protected]> Signed-off-by: Nathaniel Brough <[email protected]>

nathaniel-brough force-pushed the main branch from b89ce67 to 1a5b1f1 Compare October 16, 2024 05:48

SWilson4 merged commit 0310631 into open-quantum-safe:main Oct 18, 2024
72 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a basic fuzz testing harness for Dilithium2 #1905

Add a basic fuzz testing harness for Dilithium2 #1905

nathaniel-brough commented Aug 27, 2024

nathaniel-brough commented Aug 27, 2024

baentsch commented Aug 27, 2024

nathaniel-brough commented Aug 27, 2024

baentsch commented Aug 27, 2024

baentsch left a comment

baentsch commented Aug 27, 2024

dstebila commented Aug 27, 2024

baentsch commented Aug 27, 2024

nathaniel-brough commented Aug 29, 2024

SWilson4 left a comment

SWilson4 commented Sep 9, 2024

nathaniel-brough commented Sep 10, 2024

nathaniel-brough commented Sep 16, 2024

baentsch commented Sep 17, 2024

dstebila commented Sep 17, 2024

nathaniel-brough commented Sep 23, 2024

nathaniel-brough commented Sep 23, 2024

SWilson4 left a comment

SWilson4 Sep 23, 2024

nathaniel-brough Sep 24, 2024

nathaniel-brough Sep 24, 2024

SWilson4 Oct 15, 2024

dstebila commented Oct 1, 2024

nathaniel-brough commented Oct 1, 2024

dstebila commented Oct 1, 2024

nathaniel-brough commented Oct 15, 2024

SWilson4 Oct 15, 2024

SWilson4 commented Oct 15, 2024

nathaniel-brough commented Oct 16, 2024

SWilson4 commented Oct 18, 2024

dstebila commented Oct 18, 2024

SWilson4 commented Oct 18, 2024

Add a basic fuzz testing harness for Dilithium2 #1905

Add a basic fuzz testing harness for Dilithium2 #1905

Conversation

nathaniel-brough commented Aug 27, 2024

nathaniel-brough commented Aug 27, 2024

baentsch commented Aug 27, 2024

nathaniel-brough commented Aug 27, 2024

baentsch commented Aug 27, 2024

baentsch left a comment

Choose a reason for hiding this comment

baentsch commented Aug 27, 2024

dstebila commented Aug 27, 2024

baentsch commented Aug 27, 2024

nathaniel-brough commented Aug 29, 2024

SWilson4 left a comment

Choose a reason for hiding this comment

SWilson4 commented Sep 9, 2024

nathaniel-brough commented Sep 10, 2024

nathaniel-brough commented Sep 16, 2024

baentsch commented Sep 17, 2024

dstebila commented Sep 17, 2024

nathaniel-brough commented Sep 23, 2024

nathaniel-brough commented Sep 23, 2024

SWilson4 left a comment

Choose a reason for hiding this comment

SWilson4 Sep 23, 2024

Choose a reason for hiding this comment

nathaniel-brough Sep 24, 2024

Choose a reason for hiding this comment

nathaniel-brough Sep 24, 2024

Choose a reason for hiding this comment

SWilson4 Oct 15, 2024

Choose a reason for hiding this comment

dstebila commented Oct 1, 2024

nathaniel-brough commented Oct 1, 2024

dstebila commented Oct 1, 2024

nathaniel-brough commented Oct 15, 2024

SWilson4 Oct 15, 2024

Choose a reason for hiding this comment

SWilson4 commented Oct 15, 2024

nathaniel-brough commented Oct 16, 2024

SWilson4 commented Oct 18, 2024

dstebila commented Oct 18, 2024

SWilson4 commented Oct 18, 2024