Skip to content

Latest commit

 

History

History
12 lines (8 loc) · 482 Bytes

File metadata and controls

12 lines (8 loc) · 482 Bytes

Strix Benchmarks

This repository contains benchmark evaluation infrastructure for Strix. It provides standardized evaluation pipelines for testing Strix capabilities across various security tasks.

Available Benchmarks

Benchmark Description Challenges
XBEN XBOW web security CTF challenges 104

Note

We are actively adding more benchmarks to our evaluation suite.