Probabilistic Conceptual Explainers
for Foundation Models

This repo contains the code and data for our PACE (ICML 2024 paper):

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models
Hengyi Wang*, Shiwei Tan*, Hao Wang
[Paper] [ICML Website]

and our VALC (EMNLP 2024 Findings paper):

Variational Language Concepts for Interpreting Foundation Language Models
Hengyi Wang, Shiwei Tan, Zhiqing Hong, Desheng Zhang, Hao Wang
[Paper] [ACL Website]

Brief Introduction for PACE

We propose five desiderata for explaining vision foundation models like ViTs - faithfulness, stability, sparsity, multi-level structure, and parsimony - and demonstrate the inadequacy of current methods in meeting these criteria comprehensively. Rather than using sparse autoencoders (SAEs), we introduce a variational Bayesian explanation framework, dubbed ProbAbilistic Concept Explainers (PACE), which models the distributions of patch embeddings to provide trustworthy post-hoc conceptual explanations. Our PACE can provide dataset-, image-, and patch-level explanations for ViTs and achieves all five desiderata (faithfulness, stability, sparsity, multi-level structure, and parsimony) in a unified framework.

Probabilistic Conceptual Explainers (PACE) for Vision Transformers (ViTs)

PACE is compatible with arbitrary vision transformers.

Below are some sample concepts automatically discovered by our PACE, without the need for concept annotation during training.

Figure 1. Above are some sample concepts discovered by PACE in the COLOR dataset. See Figure 3 of our paper for details on the COLOR dataset.

Figure 2. Above are some sample concepts discovered by PACE in the Oxford Flower dataset.

Installation

conda env create -f environment_PACE.yml
conda activate PACE
cd src

Generate the Color Dataset

python generate_data.py

Finetune ViT for Color Dataset and Real-World Datasets

bash ./train_ViT.sh

Train PACE for Each Dataset

bash ./train_PACE.sh

Test PACE for Each Dataset

bash ./eval_PACE.sh

Probabilistic Conceptual Explainers (VALC) for Pretrained Language Models

Coming Soon!

Reference

@inproceedings{PACE,
  title={Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models},
  author={Hengyi Wang and
          Shiwei Tan and
          Hao Wang},
  booktitle={International Conference on Machine Learning},
  year={2024}
}

@inproceedings{VALC,
  title={Variational Language Concepts for Interpreting Foundation Language Models},
  author={Hengyi Wang and
          Shiwei Tan and
          Zhiqing Hong and 
          Desheng Zhang and
          Hao Wang},
  booktitle={Findings of the Association for Computational Linguistics: EMNLP 2024},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
PACE		PACE
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Probabilistic Conceptual Explainers
for Foundation Models

Brief Introduction for PACE

Probabilistic Conceptual Explainers (PACE) for Vision Transformers (ViTs)

Installation

Generate the Color Dataset

Finetune ViT for Color Dataset and Real-World Datasets

Train PACE for Each Dataset

Test PACE for Each Dataset

Probabilistic Conceptual Explainers (VALC) for Pretrained Language Models

Reference

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Wang-ML-Lab/interpretable-foundation-models

Folders and files

Latest commit

History

Repository files navigation

Probabilistic Conceptual Explainers for Foundation Models

Brief Introduction for PACE

Probabilistic Conceptual Explainers (PACE) for Vision Transformers (ViTs)

Installation

Generate the Color Dataset

Finetune ViT for Color Dataset and Real-World Datasets

Train PACE for Each Dataset

Test PACE for Each Dataset

Probabilistic Conceptual Explainers (VALC) for Pretrained Language Models

Reference

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Probabilistic Conceptual Explainers
for Foundation Models

Packages