Interpretability for Language Models: Current Trends and Applications | Gabriele Sarti
Home
About me
Publications
Blog
Talks
Projects
Activities
CV
Communities
AI2S
AISIG
Interpretability for Language Models: Current Trends and Applications
Gabriele Sarti
Natural Language Processing
,
Academic
Project
Project
Slides
Date
Nov 5, 2024
Event
Seminar, PhD Course on XAI, Sapienza University of Rome
Location
Online
Natural Language Processing
Interpretability
Sequence-to-sequence
Language Modeling
Feature Attribution
Related
Interpreting Context Usage in Generative Language Models with Inseq and PECoRe
Post-hoc Interpretability for Generative Language Models: Explaining Context Usage in Transformers
Post-hoc Interpretability for Language Models
Post-hoc Interpretability for NLG & Inseq: an Interpretability Toolkit for Sequence Generation Models
Explaining Neural Language Models from Internal Representations to Model Predictions
Cite
×