Merge the default one-prompt method into the new agentic workflow (Analysis+Enhancement) #785

DonggeLiu · 2025-02-10T05:18:47Z

Ensure GKE uses service account.
Remove JCC.
Add cycle in results.
Use cycle when saying logs.
Replace old Run/BuildResults with the new one.
Clean up repeated code (e.g., parse_libfuzzer_log)

local

DonggeLiu · 2025-02-10T05:19:14Z

/gcbrun exp -n dg

DonggeLiu · 2025-02-10T10:23:10Z

/gcbrun exp -n dg

DonggeLiu · 2025-02-10T11:06:41Z

/gcbrun exp -n dg

DonggeLiu · 2025-02-10T11:37:50Z

/gcbrun exp -n dg

DonggeLiu · 2025-02-11T04:52:10Z

/gcbrun exp -n dg

DonggeLiu · 2025-02-11T05:44:05Z

/gcbrun exp -n dg1

DonggeLiu · 2025-02-13T13:11:08Z

/gcbrun exp -n dg1

DonggeLiu · 2025-02-14T00:50:21Z

/gcbrun exp -n dg2

DonggeLiu · 2025-02-14T01:07:42Z

/gcbrun exp -n dg3

DonggeLiu · 2025-02-14T04:45:48Z

/gcbrun exp -n dg3

arthurscchan · 2025-02-14T05:13:10Z

@DonggeLiu LGTM for the merge.

DonggeLiu · 2025-02-14T10:23:01Z

/gcbrun exp -n dg4

DonggeLiu · 2025-02-14T12:10:17Z

/gcbrun exp -n dg5

DonggeLiu · 2025-02-14T20:36:06Z

The report looks good!
https://llm-exp.oss-fuzz.com/Result-reports/ofg-pr/2025-02-14-785-dg5-comparison/index.html

DavidKorczynski

This is great -- left a few nits but approved so you can land it when you're ready

DavidKorczynski · 2025-02-14T22:47:11Z

agent/one_prompt_enhancer.py

@@ -0,0 +1,73 @@
+"""An LLM agent to improve a fuzz target's runtime performance.


missing license?

Fixed, thanks!
I will add a CI check later.

Still missing :)

DavidKorczynski · 2025-02-14T22:47:33Z

agent/one_prompt_prototyper.py


-MAX_ROUND = 100
+MAX_ROUND = 5


is it intentional to set it to 5? Seems like a big reduction from 100?

Good point—it's worth trying a higher value (e.g., 10).

The original one-prompt workflow was capped at 5 rounds because each round built a fuzz target on a new Cloud Build, which was expensive. Now that we're using a cheaper approach, we can afford more rounds. However, since the one-prompt method only gets build error messages, if the LLM can't generate a valid target within a few rounds, additional rounds likely won't help, so 100 would be excessive.

For context, 100 rounds were used in the agentic method where rounds are lightweight and can obtain new information (e.g., bash commands).

DavidKorczynski · 2025-02-14T22:50:19Z

experiment/builder_runner.py

-        generated_project, benchmark_target_name,
-        self.work_dirs.run_logs_target(benchmark_target_name, iteration))
+    # run_log_path = self.work_dirs.run_logs_target(benchmark_target_name,
+    #                                               iteration)


do we want to keep these comments?

DavidKorczynski · 2025-02-14T22:50:39Z

experiment/builder_runner.py

@@ -538,7 +548,8 @@ def build_and_run_local(
        run_result.crashes, run_result.crash_info, \
          run_result.semantic_check = \
            self._parse_libfuzzer_logs(f, project_name, flag)
-      run_result.succeeded = not run_result.semantic_check.has_err
+      # run_result.succeeded = not run_result.semantic_check.has_err


intentional to leave this commented out?

DavidKorczynski · 2025-02-14T22:51:07Z

experiment/builder_runner.py

-          generated_project)
+    #   # Overwrite the Dockerfile to be caching friendly
+    #   oss_fuzz_checkout.rewrite_project_to_cached_project_chronos(
+    #       generated_project)


could you leave a comment on why this is being commented out?

DavidKorczynski · 2025-02-14T22:51:33Z

experiment/evaluator.py

+    #              run_result.semantic_check.type,
+    #              run_result.triage,
+    #              compile_error=build_result.log_path,
+    #              compile_log=build_result.log_path))


could you leave a comment here?

DavidKorczynski · 2025-02-14T22:54:52Z

logger.py

+        local_file.write(tmp_file.read())
+
+      os.remove(tmp_path)
+      # blob.download_to_filename(local_path)


could you leave a comment/remove/uncomment?

DavidKorczynski · 2025-02-14T22:56:40Z

logger.py

@@ -4,9 +4,13 @@
 import json


missing a license here as well?

DavidKorczynski · 2025-02-14T22:56:51Z

pipeline.py

@@ -4,7 +4,7 @@

 import logger


missing license?

DavidKorczynski · 2025-02-14T22:57:02Z

results.py

@@ -4,6 +4,7 @@



license nit

mihaimaruseac · 2025-02-15T14:40:06Z

agent/one_prompt_enhancer.py

@@ -0,0 +1,73 @@
+"""An LLM agent to improve a fuzz target's runtime performance.


Still missing :)

DonggeLiu added 15 commits February 10, 2025 14:49

An Enhancer to fix fuzz target based on analysis

06b6ede

Temporarily pass trial as a parameter to name logs

66b8584

No semantic analysis when building and running fuzz target

704111b

generate success does not require run success

9010f89

Minor lint fix

af969d1

Make semantic result Serializable

70ffd9e

Temporarily disable cache because of an error in coverage image

3b68f28

A way to download run log from GCS

af183c6

Add analysis and enhance steps in pipline

b081d9b

A new result type: AnalysisResult

185dc38

Add new steps and agents

4984225

A scaffold analysis stage

0022666

Allow getting agent by ID

996e372

A build_result for temp debugging, add trial for writing run log to

d5f42cb

local

Get enhancer by ID

7bd62d8

DonggeLiu marked this pull request as draft February 10, 2025 10:23

A semantic analyzer

0c0f344

DonggeLiu added 3 commits February 11, 2025 15:48

Allow agents to send a single query to LLM

4a28d4e

Minor code restructure to ease usages

26a03fa

Simplify one_prompt_prototyper to build fuzz target

78e6fbf

DonggeLiu added 2 commits February 11, 2025 16:37

Append analysis result into chat history log

5aacacb

Add a place holder for enhancer chat history

edbcc3c

DonggeLiu added 2 commits February 11, 2025 21:42

Remove unused function

e930a5f

Correct info in AnalysisResult

72b245b

Make func_info JSON serializable to save into result.json

27ba11b

DonggeLiu added 4 commits February 14, 2025 10:45

Make OPP logs more flexible for its children

f4edc9b

Re-try on service unavailable

68d8f25

Verify if GOOGLE APPLICATION CREDENTIALS is set

cde368f

Temporary debug logs

816daee

Append runlogs of different cycles, not overwriting

424a96b

DonggeLiu added 4 commits February 14, 2025 13:07

Append to run/build logs, not overwrite

b8ede94

Service account debug log +1

f5c7d55

More structured result aggregation

477d7d2

TODOs to simplify Result type

6e14e89

DonggeLiu added 2 commits February 14, 2025 21:11

Measure coverage in execution stage for now

2244684

Fix log

306357a

Overwrite logs instead of append

f8608a3

DonggeLiu requested review from oliverchang, mihaimaruseac and DavidKorczynski February 14, 2025 20:35

DonggeLiu marked this pull request as ready for review February 14, 2025 20:36

DavidKorczynski approved these changes Feb 14, 2025

View reviewed changes

DavidKorczynski reviewed Feb 14, 2025

View reviewed changes

logger.py

@@ -4,9 +4,13 @@

import json

Copy link

Collaborator

DavidKorczynski Feb 14, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing a license here as well?

DavidKorczynski reviewed Feb 14, 2025

View reviewed changes

pipeline.py

@@ -4,7 +4,7 @@

import logger

Copy link

Collaborator

DavidKorczynski Feb 14, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing license?

DavidKorczynski reviewed Feb 14, 2025

View reviewed changes

results.py

@@ -4,6 +4,7 @@

Copy link

Collaborator

DavidKorczynski Feb 14, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

license nit

mihaimaruseac approved these changes Feb 15, 2025

View reviewed changes

agent/one_prompt_enhancer.py

@@ -0,0 +1,73 @@

"""An LLM agent to improve a fuzz target's runtime performance.

Copy link

Member

mihaimaruseac Feb 15, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Still missing :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge the default one-prompt method into the new agentic workflow (Analysis+Enhancement) #785

Merge the default one-prompt method into the new agentic workflow (Analysis+Enhancement) #785

DonggeLiu commented Feb 10, 2025 •

edited

Loading

DonggeLiu commented Feb 10, 2025

DonggeLiu commented Feb 10, 2025

DonggeLiu commented Feb 10, 2025

DonggeLiu commented Feb 10, 2025

DonggeLiu commented Feb 11, 2025

DonggeLiu commented Feb 11, 2025

DonggeLiu commented Feb 13, 2025

DonggeLiu commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

arthurscchan commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

DavidKorczynski left a comment

DavidKorczynski Feb 14, 2025

DonggeLiu Feb 15, 2025

mihaimaruseac Feb 15, 2025

DavidKorczynski Feb 14, 2025

DonggeLiu Feb 15, 2025

DavidKorczynski Feb 14, 2025

DavidKorczynski Feb 14, 2025

DavidKorczynski Feb 14, 2025

DavidKorczynski Feb 14, 2025

DavidKorczynski Feb 14, 2025

DavidKorczynski Feb 14, 2025

DavidKorczynski Feb 14, 2025

DavidKorczynski Feb 14, 2025

mihaimaruseac Feb 15, 2025

		@@ -0,0 +1,73 @@
		"""An LLM agent to improve a fuzz target's runtime performance.


		MAX_ROUND = 100
		MAX_ROUND = 5

Merge the default one-prompt method into the new agentic workflow (Analysis+Enhancement) #785

Are you sure you want to change the base?

Merge the default one-prompt method into the new agentic workflow (Analysis+Enhancement) #785

Conversation

DonggeLiu commented Feb 10, 2025 • edited Loading

DonggeLiu commented Feb 10, 2025

DonggeLiu commented Feb 10, 2025

DonggeLiu commented Feb 10, 2025

DonggeLiu commented Feb 10, 2025

DonggeLiu commented Feb 11, 2025

DonggeLiu commented Feb 11, 2025

DonggeLiu commented Feb 13, 2025

DonggeLiu commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

arthurscchan commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

DonggeLiu commented Feb 14, 2025

DavidKorczynski left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DonggeLiu commented Feb 10, 2025 •

edited

Loading