Interpretability for Language Models: Current Trends and Applications | Gabriele Sarti
Home
About me
Publications
Blog
Talks
Projects
Activities
CV
Academic CV
Short CV
Tools
Inseq
LangLearn
Scholar Monitor
Communities
AI2S
AISIG
Interpretability for Language Models: Current Trends and Applications
Gabriele Sarti
Natural Language Processing
,
Academic
Project
Project
Slides
Date
May 12, 2026
Event
Invited Lecture, PhD Course on Mechanistic Interpretability of Large Language Models, National PhD in AI for Society, University of Pisa
Location
Online
Pisa, Italy
Natural Language Processing
Interpretability
Language Modeling
Feature Attribution
Retrieval-augmented Generation
Mechanistic Interpretability
Related
Attribution: Tracing Influence to Inputs and Model Components
Interpretability for Language Models: Current Trends and Applications
Scaling Interpretability for LLM Agents
Interpretability for Language Models: Current Trends and Applications
Interpreting Context Usage in Generative Language Models
Cite
×