Interpretability | Gabriele Sarti

Interpretability

Attributing Context Usage in Language Models

An interpretability framework to detect and attribute context usage in language models' generations

An open-source library to democratize access to model interpretability for sequence generation models