Popular repositories Loading
-
inspect_ai
inspect_ai PublicInspect: A framework for large language model evaluations
-
-
control-arena
control-arena PublicControlArena is a suite of realistic settings, mimicking complex deployment environments, for running control evaluations. This is an alpha release; we welcome feedback.
-
as-evaluation-standard
as-evaluation-standard Public templateA repository that holds templates, examples, and tests to help external parties submit tasks to AISI that conform with the Autonomous Systems Team's Task Standard
-
inspect_k8s_sandbox
inspect_k8s_sandbox PublicA Kubernetes sandbox environment for use with inspect_ai
-
Repositories
- dsit-dvs-register Public
UKGovernmentBEIS/dsit-dvs-register’s past year of commit activity - dsit-dvs-register-admin Public
UKGovernmentBEIS/dsit-dvs-register-admin’s past year of commit activity - control-arena Public
ControlArena is a suite of realistic settings, mimicking complex deployment environments, for running control evaluations. This is an alpha release; we welcome feedback.
UKGovernmentBEIS/control-arena’s past year of commit activity - help-to-heat-GBIS Public
UKGovernmentBEIS/help-to-heat-GBIS’s past year of commit activity - transparency-db-admin-portal Public
UKGovernmentBEIS/transparency-db-admin-portal’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…