CommitUp

CommitUp, an LLM-based test suite evolution tool for Java projects.

Installation and Setup

Prerequisites

Python 3.11+
Java 1.8
Java 15+ is required for call graph generation (java-callgraph-1.0-SNAPSHOT-jar-with-dependencies.jar)
Maven 3.2.5+
Git
uv

Optional (model backend):

qwen7b: local vLLM (http://localhost:8000/v1)
qwen7b-ollama: local Ollama + qwen2.5-coder:latest

Install dependencies

uv python install 3.11
uv sync

Clone benchmark repositories

mkdir -p exp_repos
bash exp_repos/clone_repos.sh

Prepare reranker model

Place a local bge-reranker-v2-gemma model directory, for example:

<repo_root>/models/bge-reranker-v2-gemma

Configure environment

cp .env.template .env

Recommended .env (absolute paths):

ROOT_PATH="/absolute/path/to/CommitUp"
JAVA_REPO_PATH="/absolute/path/to/CommitUp/exp_repos"
RERANKER_PATH="/absolute/path/to/CommitUp/models/bge-reranker-v2-gemma"
DATASET_PATH="/absolute/path/to/CommitUp/data/benchmark.json"
DEEPSEEK_API_KEY="your_deepseek_api_key"
OPENAI_API_URL="https://your-openai-compatible-endpoint/v1"
OPENAI_API_KEY="your_openai_api_key"
POM_PATH=""

Project Structure

CommitUp/
│
├── README.md
├── README_EN.md
├── main.py
├── components.py
├── data/
│   ├── benchmark.json
│   └── load_data.py
├── core/
│   ├── agents/
│   ├── env/
│   ├── llms/
│   ├── prompts/
│   └── rag/
├── exp_repos/
│   └── clone_repos.sh
└── run_logs/

Usage

Supported models:

deepseek-chat
gpt-4o
qwen7b
qwen7b-ollama

Run one sample:

uv run -m main --model deepseek-chat --start-index 0 --end-index 1 --temperature 0

Run full benchmark:

uv run -m main --model deepseek-chat --start-index 0 --end-index 248 --temperature 0

Example

Use OpenAI-compatible backend:

uv run -m main --model gpt-4o --start-index 0 --end-index 248 --temperature 0

Use local vLLM:

vllm serve Qwen/Qwen2.5-Coder-7B-Instruct --port 8000
uv run -m main --model qwen7b --start-index 0 --end-index 248 --temperature 0

Output files:

run_logs/benchmark__<uuid>/<case_index>__<instance_id>/result.json

Notes

The run process executes git reset --hard and git checkout -f <commit> inside exp_repos.
Do not keep uncommitted work in repositories under exp_repos.
Use absolute paths for JAVA_REPO_PATH, DATASET_PATH, and RERANKER_PATH.
Always run one sample first before full benchmark.
Full runs are long; chunked execution is recommended.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
core		core
data		data
exp_repos		exp_repos
.env.template		.env.template
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
README_CN.md		README_CN.md
components.py		components.py
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CommitUp

Table of Contents

Installation and Setup

Prerequisites

Install dependencies

Clone benchmark repositories

Prepare reranker model

Configure environment

Project Structure

Usage

Example

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CommitUp

Table of Contents

Installation and Setup

Prerequisites

Install dependencies

Clone benchmark repositories

Prepare reranker model

Configure environment

Project Structure

Usage

Example

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages