Dialz: A Python Toolkit for Steering Vectors

About

Steering vectors allow users to modify activations at inference time to amplify or weaken a 'concept', e.g. honesty or positivity.

Dialz supports a diverse set of tasks, including creating contrastive pair datasets, computing and applying steering vectors, and visualizations.

A basic tutorial can be found here.

pip install dialz

Check out the full documentation for usage information.

Any contributions to improve this project are welcome! Please open an issue or pull request in this repo with any changes you have.

This code is released under a MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.github/workflows		.github/workflows
dialz		dialz
docs		docs
notebooks		notebooks
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt