benchmarking for prompting strategies with no Human in the loop #247

cristobalvch · 2025-08-21T08:43:21Z

Benchmark designed to evaluate a fully automated integration of LLMs (Large Language Models) with no HITL (Human-in-the-Loop) into web application attack scenarios using CAI (Cybersecurity AI). Its goal is to test various prompting strategies and different LLMs to assess their effectiveness in identifying vulnerabilities within web applications.

vmayoral · 2025-08-22T05:03:29Z

benchmarks/prompt-bench/README.md

+## Project Folder Structure
+In this section, the main folder structure is described.
+```plaintext
+llm-cai-project/                  # Root directory of the project


@cristobalvch i think this route is wrong

vmayoral · 2025-08-22T05:04:25Z

benchmarks/prompt-bench/README.md

+
+**Fully Automated (No HITL):**  
+The pipeline is designed to be **fully automated, with no Human-in-the-Loop (HITL)**. When the agent attempts to solve the challenge labs, **no human interaction with the model is required**; all decisions, iterations, and actions are executed autonomously according to the experiment’s configuration and the prompt templates.
+


@cristobalvch could you please add a "results" section in here wherein you include images and summarize results obtained, that'd be very interesting.

Mery-Sanz · 2025-08-22T06:09:48Z

I really liked the implementation! I tested it and it works correctly.
The only thing I’d need is for you to add a /logs folder with a .gitkeep, otherwise this error is raised when running:

FileNotFoundError: [Errno 2] No such file or directory: 'logs'

Also, if you could add a table with all the PortSwigger challenges, that would be awesome 🙌
It should be trivial to generate from benchmarks/prompt-bench/utils/portswigger_labs.json.

Thank you very much for your collaboration @cristobalvch

cristobalvch · 2025-08-22T06:19:35Z

I'm on it!

vmayoral · 2025-08-29T14:18:19Z

@cristobalvch ping us whenever this is ready for another review and thanks for the contrib!

cristobalvch · 2025-08-30T08:22:44Z

yes thanks for waiting me! It's almost complete. I just have to run again some evaluations to add the results sections with better metrics performance :)

cris and others added 5 commits August 1, 2025 14:04

Update python examples on readme and portswigger bot in fluency folder

87c2862

Merge branch 'aliasrobotics:main' into main

3a3bbdd

Merge branch 'aliasrobotics:main' into main

9a0adf7

Merge remote-tracking branch 'upstream/main'

9447ac5

add benchmarking for prompting methods

9d92b18

vmayoral requested changes Aug 22, 2025

View reviewed changes

vmayoral self-assigned this Aug 22, 2025

cristobalvch and others added 5 commits September 3, 2025 11:51

Merge branch 'aliasrobotics:main' into main

8ee0c19

add updated readme for prompt-bench project

1fc7c44

add function to create log folder

c9c2c88

update readme project structure

e139441

remove readme on utils folder

5a32b8d

cristobalvch closed this Sep 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

benchmarking for prompting strategies with no Human in the loop #247

benchmarking for prompting strategies with no Human in the loop #247

Uh oh!

cristobalvch commented Aug 21, 2025

Uh oh!

vmayoral Aug 22, 2025

Uh oh!

vmayoral Aug 22, 2025

Uh oh!

Mery-Sanz commented Aug 22, 2025

Uh oh!

cristobalvch commented Aug 22, 2025 •

edited

Loading

Uh oh!

vmayoral commented Aug 29, 2025

Uh oh!

cristobalvch commented Aug 30, 2025

Uh oh!

Uh oh!


		Fully Automated (No HITL):
		The pipeline is designed to be fully automated, with no Human-in-the-Loop (HITL). When the agent attempts to solve the challenge labs, no human interaction with the model is required; all decisions, iterations, and actions are executed autonomously according to the experiment’s configuration and the prompt templates.

benchmarking for prompting strategies with no Human in the loop #247

benchmarking for prompting strategies with no Human in the loop #247

Uh oh!

Conversation

cristobalvch commented Aug 21, 2025

Uh oh!

vmayoral Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

vmayoral Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

Mery-Sanz commented Aug 22, 2025

Uh oh!

cristobalvch commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vmayoral commented Aug 29, 2025

Uh oh!

cristobalvch commented Aug 30, 2025

Uh oh!

Uh oh!

cristobalvch commented Aug 22, 2025 •

edited

Loading