Interpretability for Language Models: Current Trends and Applications | Gabriele Sarti
Home
About me
Publications
Blog
Talks
Projects
Activities
CV
Academic CV
Short CV
Tools
Inseq
LangLearn
Communities
AI2S
AISIG
Interpretability for Language Models: Current Trends and Applications
Gabriele Sarti
Natural Language Processing
,
Academic
Project
Project
Slides
Date
Nov 19, 2025
Event
Invited Lecture, MSc Course on Explainable and Neuro-symbolic AI, University of Trieste
Location
Online
Italy
Natural Language Processing
Interpretability
Language Modeling
Feature Attribution
Retrieval-augmented Generation
Mechanistic Interpretability
Related
Attribution: Tracing Influence to Inputs and Model Components
Interpretability for Language Models: Current Trends and Applications
Interpretability for Language Models: Trends and Applications
Interpreting Context Usage in Generative Language Models
Interpreting Context Usage in Generative Language Models
Cite
×