This directory provides ability to benchmarks agents (e.g., built using Autogen) using AgBench. Use the instructions below to prepare your environment for benchmarking. Once done, proceed to relevant benchmarks directory (e.g., benchmarks/GAIA
) for further scenario-specific instructions.
-
Install Docker Desktop. After installation, restart is needed, then open Docker Desktop, in Settings, Ressources, WSL Integration, Enable integration with additional distros – Ubuntu
-
Clone autogen and export
AUTOGEN_REPO_BASE
. This environment variable enables the Docker containers to use the correct version agents.git clone [email protected]:microsoft/autogen.git export AUTOGEN_REPO_BASE=<path_to_autogen>
-
Install
agbench
. AgBench is currently a tool in the Autogen repo.cd autogen/python/packages/agbench pip install -e .