🎌 Frame Representation Hypothesis

Authors: Pedro Valois, Lincon Souza, Erica Kido Shimomoto, Kazuhiro Fukui

The Frame Representation Hypothesis is a robust framework for understanding and controlling LLMs. We use WordNet to generate concepts that can both guide the model text generation and expose biases or vulnerabilities.

💡 Highlights

♻️ Capable of dealing with multi-token words.
🎧 Can use OMW 50M word dataset to build 100,000 concepts.
💪 Tested on Llama 3.1, Gemma 2 and Phi 3 ensuring high-quality responses.
🚀 Very fast and low memory cost. Able to compute all concepts in less than a second and fit both Llama 3.1 8B Instruct and Concepts in a RTX 4090 GPU.

Install

Clone this repository.

git clone https://github.com/Pedrexus/frame-representation-hypothesis
cd frame-representation-hypothesis

Install packages.

pip install -U pip
pip install uv
uv sync

We also provide a Docker image (you may need to update the CUDA version to yours)

Add Environment Variables

Create a .env file following the .env.example file.
You will need a Hugging Face Acces Token to Download models. Here is how to obtain it: https://huggingface.co/docs/hub/en/security-tokens
You will also need to ask for permission to download each model in models.yaml: https://huggingface.co/docs/hub/en/models-gated - hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4 - meta-llama/Meta-Llama-3.1-8B - ...

Download Models

Run 01_START_HERE.ipynb to download all models.

Quick Start

Each experiment in the paper is in one of the jupyter notebooks starting from 02.

LICENSE

Our code is released under the MIT License.

Citation

If you have any questions, please feel free to submit an issue or contact pedro@cvlab.cs.tsukuba.ac.jp.

If our work is useful for you, please cite as:

@article{valois2025framerepresentationhypothesis,
      title={Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation},
      author={Pedro H. V. Valois and Lincon S. Souza and Erica K. Shimomoto and Kazuhiro Fukui},
      journal = {Transactions of the Association for Computational Linguistics},
      year={2025},
      url={https://arxiv.org/abs/2412.07334},
}

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.devcontainer		.devcontainer
cache		cache
frames		frames
images		images
resources		resources
static		static
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
01_START_HERE.ipynb		01_START_HERE.ipynb
02_tokenization_frames.ipynb		02_tokenization_frames.ipynb
03_concept_vs_word_frames_relationship.ipynb		03_concept_vs_word_frames_relationship.ipynb
04_model_family_concept_space_comparison.ipynb		04_model_family_concept_space_comparison.ipynb
05_guided_generation_samples.ipynb		05_guided_generation_samples.ipynb
06_guided_generation_top_k_ablation.ipynb		06_guided_generation_top_k_ablation.ipynb
07_guided_generation_language_ablation.ipynb		07_guided_generation_language_ablation.ipynb
08_guided_generation_model_comparison.ipynb		08_guided_generation_model_comparison.ipynb
09_guided_generation_language_topk_ablation.ipynb		09_guided_generation_language_topk_ablation.ipynb
10_multilang_safebench_ablation.ipynb		10_multilang_safebench_ablation.ipynb
11_token_superposition_analysis.ipynb		11_token_superposition_analysis.ipynb
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
index.html		index.html
makefile		makefile
models.yaml		models.yaml
plotting.ipynb		plotting.ipynb
pyproject.toml		pyproject.toml
qsub.sh		qsub.sh
results.shelf.db		results.shelf.db
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎌 Frame Representation Hypothesis

💡 Highlights

Install

Quick Start

LICENSE

Citation

About

Uh oh!

Releases

Packages

Languages

License

moringfix/frame-representation-hypothesis

Folders and files

Latest commit

History

Repository files navigation

🎌 Frame Representation Hypothesis

💡 Highlights

Install

Quick Start

LICENSE

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages