Scattered Forest Search: Smart Code Space Optimization and Test-time Scaling with LLMs

This is the official repository for the paper Scattered Forest Search: Smart Code Space Optimization and Test-time Scaling with LLMs.

Setup

clone this repository and cd into it.

cd sfs

In sfs, initialize a conda environment (conda init) with the requirements.

conda create --name codespace-opt python=3.12
conda activate codespace-opt

Install the requirements in codespace-opt environment using pip

pip install -r requirements.txt

Install pytorch based on your system configuration from https://pytorch.org/get-started/locally/
Add openAI key

export OPENAI_API_KEY='your-api-key-here'

You can also add this to your .bashrc or .bash_profile file as follows:

echo "export OPENAI_API_KEY='your-api-key-here'" >> ~/.bashrc
source ~/.bashrc

This way you don't have to set the key every time you open a new terminal.

Running the application

The application can be run using the following command from the sfs directory:

python run.py

The open a browser and go to http://127.0.0.1:5000 to access the application. See website for more details and video tutorial.

Running experiments

Experiments can be run from the command line using the following command:

python -m src.main run_name=<run_name> solver=<solver> hydra.verbose=warning solver.params.<parameter>=<value> path.problems_set_path=<path>

Example:

python -m src.main run_name=sfs_0 solver=synthesis hydra.verbose=warning solver.params.strategy_library_name=mcts solver.params.num_seeds=3 solver.params.max_iters=7 paths.problem_set_path="data/original_problems/humaneval-py_hardest50.jsonl"

There are also scripts set up to batch run the experiments in the paper. These are located in the src/scripts directory, and require slurm to be set up on your system.

Experiment analysis

You can analyze the results of the experiments by following the instructions in the notebooks/data_analysis.ipynb notebook. This notebook will load the results of the experiments and generate the plots used in the paper.

Data preprocessing

The repo contains readily available data for the experiments. However, if you want to preprocess your own data, you can follow the instructions in the notebooks/data_preprocessing.ipynb notebook. This notebook will preprocess the data and save it in the correct format for the experiments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scattered Forest Search: Smart Code Space Optimization and Test-time Scaling with LLMs

Setup

Running the application

Running experiments

Experiment analysis

Data preprocessing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
data		data
notebooks		notebooks
src		src
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

License

L-I-M-I-T/sfs

Folders and files

Latest commit

History

Repository files navigation

Scattered Forest Search: Smart Code Space Optimization and Test-time Scaling with LLMs

Setup

Running the application

Running experiments

Experiment analysis

Data preprocessing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages