Interpretability | Gabriele Sarti

Interpretability

Inseq: An Interpretability Toolkit for Sequence Generation Models

An open-source library to democratize access to model interpretability for sequence generation models

PECoRe: Plausibility Evaluation of Context Usage in Language Models

An interpretability framework to detect and attribute context usage in language models' generations