Interpretability for Language Models: Current Trends and Applications | Gabriele Sarti
Home
About me
Publications
Blog
Talks
Projects
Activities
CV
Communities
AI2S
AISIG
Interpretability for Language Models: Current Trends and Applications
Gabriele Sarti
Natural Language Processing
,
Academic
Code
Project
Project
Slides
Date
Mar 24, 2025
Event
Seminar, MSc Course on Trustworthy and Explainable AI, University of Groningen
Location
Bernoulli Institute, Faculty of Science and Engineering, University of Groningen
Groningen, Netherlands
Natural Language Processing
Interpretability
Sequence-to-sequence
Language Modeling
Feature Attribution
Retrieval-augmented Generation
Related
Interpreting Context Usage in Generative Language Models
Interpreting Context Usage in Generative Language Models with Inseq, PECoRe and MIRAGE
Interpretability for Language Models: Current Trends and Applications
Interpreting Context Usage in Generative Language Models with Inseq and PECoRe
Post-hoc Interpretability for Generative Language Models: Explaining Context Usage in Transformers
Cite
×