Interpretability for Language Models: Current Trends and Applications | Gabriele Sarti
Home
About me
Publications
Blog
Talks
Projects
Activities
CV
Academic CV
Short CV
Communities
AI2S
AISIG
Interpretability for Language Models: Current Trends and Applications
Gabriele Sarti
Natural Language Processing
,
Academic
Project
Project
Slides
Date
Nov 19, 2025
Event
Invited Lecture, MSc Course on Explainable and Neuro-symbolic AI, University of Trieste
Location
Online
Italy
Natural Language Processing
Interpretability
Language Modeling
Feature Attribution
Retrieval-augmented Generation
Mechanistic Interpretability
Related
Interpretability for Language Models: Current Trends and Applications
Interpreting Context Usage in Generative Language Models
Interpreting Context Usage in Generative Language Models
QE4PE: Word-level Quality Estimation for Human Post-Editing
Interpreting Context Usage in Generative Language Models with Inseq, PECoRe and MIRAGE
Cite
×