Inseq: An Interpretability Toolkit for Sequence Generation Models

Gabriele Sarti, Nils Feldhus, Oskar van der Waals, Ludwig Sickert

Dec 13, 2022 Interpretability

Docs Repository PyPI Twitter Mastodon Hugging Face

Inseq is a Pytorch-based hackable toolkit to democratize the study of interpretability for sequence generation models. Inseq supports a wide set of models from the 🤗 Transformers library and an ever-growing set of feature attribution methods, leveraging in part the widely-used Captum library. For a quick introduction to common use cases, see the Getting started with Inseq page.

Using Inseq, feature attribution maps that can be saved, reloaded, aggregated and visualized either as HTMLs (with Jupyter notebook support) or directly in the console using rich. Besides simple attribution, Inseq also supports features like step score extraction, attribution aggregation and attributed functions customization for more advanced use cases.

Natural Language Processing Interpretability HuggingFace Deep Learning Natural Language Generation

Publications

Inseq: An Interpretability Toolkit for Sequence Generation Models

We present Inseq, a Python library to democratize access to interpretability analyses of sequence generation models.

Published in: ACL Demo 2023

Gabriele Sarti, Nils Feldhus, Ludwig Sickert, Oskar van der Wal, Malvina Nissim, Arianna Bisazza

PDF Project Proceedings ArXiv Docs Repository PyPI Twitter Discord Hugging Face Tutorial

Talks

Interpreting Context Usage in Generative Language Models with Inseq, PECoRe and MIRAGE

This presentation focuses on applying post-hoc interpretability techniques to analyze how language models (LMs) use input information …

Jul 16, 2024 Ludwig Maximilian University of Munich, Bayern, Germany CIS LMU Seminar

Gabriele Sarti

Code Project Project Slides

Interpreting Context Usage in Generative Language Models with Inseq and PECoRe

This talk discusses the challenges and opportunities in conducting interpretability analyses of generative language models. We begin by …

May 20, 2024 Politecnico di Torino, Piedmont, Italy Politecnico di Torino Invited Talk

Gabriele Sarti

Code Project Project Slides

Post-hoc Interpretability for Generative Language Models: Explaining Context Usage in Transformers

This talk discusses the challenges of interpreting generative language models and presents Inseq, a toolkit for interpreting sequence …

Mar 1, 2024 Online SheffieldNLP Invited Talk

Gabriele Sarti

Code Project Project Slides

Explaining Language Models with Inseq

In recent years, Transformer-based language models have achieved remarkable progress in most language generation and understanding …

Nov 2, 2023 University of Amsterdam, Amsterdam InDeep Masterclass - Explaining Foundation Models

Gabriele Sarti, Grzegorz Chrupała, Arianna Bisazza

Code Project Slides

Post-hoc Interpretability for Language Models

This talk discusses the challenges of interpreting generative language models and presents Inseq, a toolkit for interpreting sequence …

Oct 26, 2023 eScience Center, Amsterdam eScience Center SIG-NLP Seminar

Gabriele Sarti

Code Project Project Slides

Post-hoc Interpretability for NLG & Inseq: an Interpretability Toolkit for Sequence Generation Models

In recent years, Transformer-based language models have achieved remarkable progress in most language generation and understanding …

Jul 2, 2023 L'Arboç, Tarragona, Spain Tutorial at REST-CL, Universitat Pompeu Fabra

Gabriele Sarti

Code Project Slides

Post-hoc Interpretability for Neural Language Models

In recent years, Transformer-based language models have achieved remarkable progress in most language generation and understanding …

Jun 1, 2023 University of Trieste, Italy Invited Talk at COSMO Seminars, AI-Lab UniTS

Gabriele Sarti

Code Project Slides

Explaining Neural Language Models from Internal Representations to Model Predictions

As language models become increasingly complex and sophisticated, the processes leading to their predictions are growing increasingly …

May 31, 2023 University of Pisa, Italy Lab at AILC Lectures on Computational Linguistics 2023

Gabriele Sarti, Alessio Miaschi

Project Lab Materials

Post-hoc Interpretability for Neural Language Models

In recent years, Transformer-based language models have achieved remarkable progress in most language generation and understanding …

May 23, 2023 University of Groningen, Groningen AILo Talk at RUG Bernoulli Institute

Gabriele Sarti

Code Project Slides

Inseq: An Interpretability Toolkit for Sequence Generation Models

This talk introduces the Inseq toolkit for interpreting sequence generation models. The usage of Inseq is illustrated with examples …

Apr 6, 2023 Sapienza University of Rome SapienzaNLP Invited Talk

Gabriele Sarti

Code Project Slides

Advanced XAI Techniques and Inseq: An Interpretability Toolkit for Sequence Generation Models

This talk introduces the Inseq toolkit for interpreting sequence generation models. The usage of Inseq is illustrated with examples …

Mar 23, 2023 Radboud University, Nijmegen InDeep Consortium Meeting - March 2023

Gabriele Sarti

Code Project Slides

Introducing Inseq: An Interpretability Toolkit for Sequence Generation Models

After motivating the usage of interpretability methods in NLP, this talk introduces the Inseq toolkit for interpreting sequence …

Mar 10, 2023 Harmonie Building, University of Groningen GroNLP Reading Group

Gabriele Sarti

Code Project Slides Video

Inseq: An Interpretability Toolkit for Sequence Generation Models

Related

Publications

Talks