Constrained Adaptive Rejection Sampling

This library allows sampling from a language model constrained by a context-free grammar.

Requirements

The library is guaranteed to work with Python 3.11; it has not been tested with other versions. It also requires several Python packages:

pip install torch transformers gpustat numpy accelerate llguidance xgrammar scipy matplotlib

Basic Usage

To run the sampling task, use the following command:

python run_task.py grammar_file prompt_file sample_style model

where:

grammar_file is a file containing a context-free grammar in a format understandable by the llguidance library (both ebnf or lark formats are supported),
prompt_file is a text file containing the prompt,
sample style is one of the following methods:
rs (Rejection Sampling),
ars (Adaptive Rejection Sampling),
rsft (Rejection Sampling with constrained First Token),
cars (Constrained Adaptive Rejection Sampling),
prefix (MCMC-Prefix),
priority (MCMC-Priority),
restart (MCMC-Restart),
model is a number between 0 and 3, specifying the model to be used.

The MCMC sampling methods are desciribed in the paper: Constrained Sampling for Language Models Should Be Easy: An MCMC Perspective.

Supported Models

0: hsultanbey/codegen350multi_finetuned (a small model, suitable for local testing on machines without a GPU)
1: meta-llama/Llama-3.1-8B-Instruct
2: Qwen/Qwen2.5-7B-Instruct
3: Qwen/Qwen2.5-14B-Instruct

Output

The program outputs the generated sequences to the standard output, along with basic information. In a more formalized way, the generated sequences are saved in a JSON file located in the runs_log folder. The subfolder name consists of three parts:

a part derived from the grammar and prompt file locations,
a hash of the grammar and prompt,
the model number.

System Environment Variables

The following environment variables can be used to configure the program:

TCFG_LOG_LEVEL: Set to INFO or DEBUG for more detailed output.
HF_HOME: Specifies the path to the folder where language models will be stored.
CUDA_VISIBLE_DEVICES: Specifies the GPU number on which the calculations will be run.

Additional Parameters

Several parameters can be set within the run_task.py file, including:

max_new_tokens: The maximum number of tokens to generate in the sampled sequence.
n_samples: The number of sequences to generate.
For rs, ars, rsft, cars styles, the program stops after n_steps calls to the LLM, even if n_samples sequences have not been produced.
For MCMC styles, n_steps represents the number of steps k (as described in the paper).
torch_dtype is a floating point data type used for the LLM computations.
It is also easy to add more models from Hugging Face. However, for MCMC styles, they must also be listed in the mcmc/lib.py file.

Name		Name	Last commit message	Last commit date
Latest commit History 222 Commits
cars		cars
datasets		datasets
experiments		experiments
mcmc		mcmc
old_scripts		old_scripts
runs_log		runs_log
.gitignore		.gitignore
README.md		README.md
display_output.py		display_output.py
distr_utils.py		distr_utils.py
llguidance_grammar_recognizer.py		llguidance_grammar_recognizer.py
logging_config.py		logging_config.py
plot_kl_divergence.py		plot_kl_divergence.py
plot_success_rate.py		plot_success_rate.py
print_if_all_data.py		print_if_all_data.py
print_kl_divergence.py		print_kl_divergence.py
print_num_unfinished.py		print_num_unfinished.py
print_success_rate.py		print_success_rate.py
run_fuzz_tasks_all.sh		run_fuzz_tasks_all.sh
run_gad_tasks_all.sh		run_gad_tasks_all.sh
run_kl_divergence.sh		run_kl_divergence.sh
run_pddl_tasks_all.sh		run_pddl_tasks_all.sh
run_smiles_tasks_all.sh		run_smiles_tasks_all.sh
run_task.py		run_task.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Constrained Adaptive Rejection Sampling

Requirements

Basic Usage

Supported Models

Output

System Environment Variables

Additional Parameters

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Constrained Adaptive Rejection Sampling

Requirements

Basic Usage

Supported Models

Output

System Environment Variables

Additional Parameters

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages