Interpretability for Language Models: Trends and Applications | Gabriele Sarti
Home
About me
Publications
Blog
Talks
Projects
Activities
CV
Academic CV
Short CV
Communities
AI2S
AISIG
Interpretability for Language Models: Trends and Applications
Gabriele Sarti
Natural Language Processing
,
Academic
Code
Project
Project
Slides
Date
Dec 18, 2025
Event
DEI Seminar
Location
Università di Padova, Italy
Padova, Veneto, Italy
Natural Language Processing
Interpretability
Sequence-to-sequence
Language Modeling
Feature Attribution
Retrieval-augmented Generation
Concept-based Interpretability
Sparse Autoencoders
Related
Interpretability for Language Models: Current Trends and Applications
Interpreting Context Usage in Generative Language Models
Interpreting Context Usage in Generative Language Models
QE4PE: Word-level Quality Estimation for Human Post-Editing
Interpreting Context Usage in Generative Language Models with Inseq, PECoRe and MIRAGE
Cite
×