Skip to content

The official repo for the Dialz Python library - a toolkit for steering vector research.

License

Notifications You must be signed in to change notification settings

cardiffnlp/dialz

Repository files navigation

PyPI license

Dialz: A Python Toolkit for Steering Vectors

Docs: https://cardiffnlp.github.io/dialz/

About

Steering vectors allow users to modify activations at inference time to amplify or weaken a 'concept', e.g. honesty or positivity.

Dialz supports a diverse set of tasks, including creating contrastive pair datasets, computing and applying steering vectors, and visualizations.

A basic tutorial can be found here.

Installation

pip install dialz

Check out the full documentation for usage information.

Contributing

Any contributions to improve this project are welcome! Please open an issue or pull request in this repo with any changes you have.

License

This code is released under a MIT license.

About

The official repo for the Dialz Python library - a toolkit for steering vector research.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published