Memorisation localisation for natural language classification tasks

Prepare by installing the conda environment using conda env create --file=env.yaml, which will install the environment memloc in python 3.9.
Add folders logs, checkpoints and subfolders to the root folder by running bash setup_folders.sh.
Train models by running run_all.sh training from within src/submit_scripts/. Model checkpoints will be stored to the checkpoints/<dataset> folder, and during training / analysis progress information will be saved to logs/<analysis_type>.
Subsequently, analyses can be conducted by running run_all.sh <analysis> from within src/submit_scripts where analysis is one of swapping | retraining | gradients | probing | centroid_analysis.
Individual analyses can be visualised using the corresponding notebooks (visualise_layer_swapping.ipynb, visualise_layer_retraining.ipynb, visualise_gradients.py, visualise_probing.ipynb), that start with cells for the control setup (section 3.2), followed by cells the main results analysis (section 4).
Centroid analysis can be performed using visualise_centroid_analysis.ipynb, after first executing visualise_mmaps.ipynb for all models / datasets, to compute the generalisation scores used in the centroid analysis correlation analysis.
Afterwards, summary visualisations can be computed using summarising_visualisations.ipynb.
For the appendix experiments using the 1.3B models, execute run_all.sh <mode> from within src/submit_scripts_big/ first using training, followed by swapping and centroid_analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
visualisations		visualisations
README.md		README.md
env.yml		env.yml
setup_folders.sh		setup_folders.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memorisation localisation for natural language classification tasks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Memorisation localisation for natural language classification tasks

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages