VLM-3R Word Selection on HPC

This is the dedicated word-selection pipeline for VLM-3R-DATA. It does not depend on spar.

What it does

The pipeline reads vsibench_train and/or vstibench_train, normalizes the records, exports question-only artifacts, and runs LLM-based word selection through an OpenAI-compatible endpoint such as vLLM.

Outputs are written to:

artifacts/vlm3r_word_selection/dataset_manifest.json
artifacts/vlm3r_word_selection/normalized_train.jsonl
artifacts/vlm3r_word_selection/normalized_summary.json
artifacts/vlm3r_word_selection/questions_only.jsonl
artifacts/vlm3r_word_selection/preview_50.json
artifacts/vlm3r_word_selection/selected_words.jsonl
artifacts/vlm3r_word_selection/selection_errors.jsonl

Each row in selected_words.jsonl contains:

visible_grounded_words: words that should correspond to directly observable entities or properties in the image/video
reasoning_words: words that are useful for reasoning but are not themselves directly visible entities, such as relations, motion, order, counting, or route constraints
selected_words: backward-compatible union of the two buckets

Local run

python scripts/vlm3r_word_selection_pipeline.py \
  --dataset-root VLM-3R-DATA \
  --model Qwen/Qwen2.5-7B-Instruct \
  --api-base http://127.0.0.1:8000/v1

To prepare only:

python scripts/vlm3r_word_selection_pipeline.py \
  --dataset-root VLM-3R-DATA \
  --model Qwen/Qwen2.5-7B-Instruct \
  --prepare-only

HPC run

If the cluster job should start vLLM itself:

sbatch hpc/sbatch_vlm3r_word_selection.sh

This full-run job uses:

partition=boost_usr_prod
qos=normal
time=10:00:00
gpus-per-node=4
cpus-per-task=32
exclusive

For a short single/debug run:

sbatch hpc/sbatch_vlm3r_word_selection_single.sh

This single-run job uses:

partition=boost_usr_prod
qos=boost_qos_dbg
time=00:30:00
gpus-per-node=4
cpus-per-task=32
default LIMIT=50

If you already have an endpoint running:

PYTHON_BIN=python \
DATASET_ROOT=/path/to/VLM-3R-DATA \
OUTPUT_DIR=/path/to/output \
MODEL_NAME=Qwen/Qwen2.5-7B-Instruct \
API_BASE=http://127.0.0.1:8000/v1 \
START_VLLM=0 \
bash hpc/run_vlm3r_word_selection.sh

Notes

vsibench_train and vstibench_train are both supported.
The selector reads from the normalized question text and returns selected_words plus a short justification.
If you want to rerun after a partial job, add --resume in the Python command.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
hpc		hpc
scripts		scripts
spar		spar
.gitignore		.gitignore
README.md		README.md
README_vlm3r_word_selection_hpc.md		README_vlm3r_word_selection_hpc.md
environment.yml		environment.yml
run_vllm_smoketest.sbatch		run_vllm_smoketest.sbatch
selected_grounded_schema_spar_preview50_errors.jsonl		selected_grounded_schema_spar_preview50_errors.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VLM-3R Word Selection on HPC

What it does

Local run

HPC run

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VLM-3R Word Selection on HPC

What it does

Local run

HPC run

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages