LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models

This repository contains the official implementation of LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models.

Overview

State space models (SSMs), particularly Mamba, have emerged as efficient alternatives to transformers for long-context sequence modeling. While recent efforts provide insights into Mamba's internal mechanisms, they don't explicitly decompose token-wise contributions. LaTIM introduces a novel token-level decomposition method for both Mamba-1 and Mamba-2 that enables fine-grained interpretability.

Features

Implementation of LaTIM decomposition method for Mamba models
Support for both Mamba-1 and Mamba-2 architectures
Evaluation tools for machine translation, copying, and retrieval-based tasks
Visualization utilities for token interaction patterns

Installation

pip install -r requirements.txt

The accompanying mamba_ssm (fork) package is available here.

Models

The codebase includes implementations for:

Mamba Language Models
Mamba Machine Translation Models
Mamba Copy Task Models

Interpretability

The src/models/mamba/interpretability_utils.py file contains the code required for interpreting and visualizing token-to-token interactions in Mamba models.

Users can follow the different notebooks (src/notebooks/) to see usage examples.

Citation

If you use this code in your research, please cite:

@misc{pitorro2025latimmeasuringlatenttokentotoken,
title={LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models},
author={Hugo Pitorro and Marcos Treviso},
year={2025},
eprint={2502.15612},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2502.15612},
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
gold-align		gold-align
notebooks		notebooks
scripts		scripts
src		src
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models

Overview

Features

Installation

Models

Interpretability

Citation

About

Releases

Packages

Languages

License

deep-spin/latim

Folders and files

Latest commit

History

Repository files navigation

LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models

Overview

Features

Installation

Models

Interpretability

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages