Gabriele Sarti

PhD in Natural Language Processing

CLCG, University of Groningen

About me

Welcome to my website! 👋 I am a PhD student at the Computational Linguistics Group of the University of Groningen and member of the InDeep consortium, working on user-centric interpretability for neural machine translation. I am also the main developer of the Inseq library. My supervisors are Arianna Bisazza, Malvina Nissim and Grzegorz Chrupała.

Previously, I was a research intern at Amazon Translate NYC, a research scientist at Aindo, a Data Science MSc student at the University of Trieste and a co-founder of the AI Student Society.

My research focuses on interpretability for generative language models, with a particular interest to end-users’ benefits and the usage of human behavioral signals. I am also into causality topics and open source collaboration.

Your (anonymous) feedback is always welcome! 🙂

Interests

Conditional Text Generation
Interpretability for Deep Learning
Behavioral Data for NLP
Causality and Uncertainty Estimation

Education

PhD in Natural Language Processing

University of Groningen (NL), 2021 - Ongoing
MSc. in Data Science and Scientific Computing

University of Trieste & SISSA (IT), 2018 - 2020
DEC in Software Management

Cégep de Saint-Hyacinthe (CA), 2015 - 2018

Experience

Applied Scientist Intern

Amazon Web Services (US), 2022
Research Scientist

Aindo (IT), 2020 - 2021
Visiting Research Assistant

ILC-CNR ItaliaNLP Lab (IT), 2019

🗞️ News

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation is accepted to EMNLP 2024, and Multi-property Steering of Large Language Models with Dynamic Activation Composition is accepted to [BlackboxNLP 2024]! See you in Miami! 🌴
Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses is accepted to CLiC-it 2024! See you in Pisa! 🎉
PECoRe is accepted to ICLR 2024, and I presented it in Vienna! 🎉 I also co-organized the first Mechanistic Interpretability social at ICLR togehter with Nikhil Prakash, and we had more than 100 attendees!
I was awarded two research grants from the Imminent Research Center and the Amsterdam eScience Center to fund the development of the Inseq library and my future research on machine translation.

Selected Publications

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

We evaluate the rebus-solving capabilities of large language models on a new Italian dataset.

Published in: Arxiv

Gabriele Sarti, Tommaso Caselli, Arianna Bisazza, Malvina Nissim

PDF Code Dataset ArXiv Models Demo

Multi-property Steering of Large Language Models with Dynamic Activation Composition

We propose Dynamic Activation Composition, an adaptive approach for multi-property activation steering of LLMs

Published in: Arxiv

Daniel Scalena, Gabriele Sarti, Malvina Nissim

PDF ArXiv Repository

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

MIRAGE uses model internals for faithful answer attribution in retrieval-augmented generation applications.

Published in: Arxiv * Equal contribution

Jirui Qi*, Gabriele Sarti*, Raquel Fernández, Arianna Bisazza

PDF Project ArXiv Demo Repository

A Primer on the Inner Workings of Transformer-based Language Models

This primer provides a concise technical introduction to the current techniques used to interpret the inner workings of …

Published in: Arxiv

Javier Ferrando, Gabriele Sarti, Arianna Bisazza, Marta Costa-jussà

PDF ArXiv

Quantifying the Plausibility of Context Reliance in Neural Machine Translation

We introduce PECoRe, an interpretability framework for identifying context dependence in language model generations.

Published in: ICLR 2024

Gabriele Sarti, Grzegorz Chrupała, Malvina Nissim, Arianna Bisazza

PDF Code Project ICLR Proceedings ArXiv Artifacts Demo

RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation

We introduce Retrieval and Attribute-Marking enhanced Prompting (RAMP) to perform attribute-controlled MT with multilingual LLMs.

Published in: ACL 2023

Gabriele Sarti, Phu Mon Htut, Xing Niu, Benjamin Hsu, Anna Currey, Georgiana Dinu, Maria Nadejde

PDF Proceedings ArXiv

See all publications

Blog posts

ICLR 2020 Trends: Better & Faster Transformers for Natural Language Processing

A summary of promising directions from ICLR 2020 for better and faster pretrained tranformers language models.

Gabriele Sarti

May 3, 2020 14 min read

Recent & Upcoming Talks

Interpreting Context Usage in Generative Language Models with Inseq, PECoRe and MIRAGE

Jul 16, 2024 Ludwig Maximilian University of Munich, Bayern, Germany CIS LMU Seminar

Interpreting Context Usage in Generative Language Models with Inseq and PECoRe

May 20, 2024 Politecnico di Torino, Piedmont, Italy Politecnico di Torino Invited Talk

Quantifying the Plausibility of Context Reliance in Neural Machine Translation

May 17, 2024 Area Science Park, Trieste, Italy Area Science Park Seminar

See all talks

Gabriele Sarti

PhD in Natural Language Processing

CLCG, University of Groningen

About me

Interests

Education

Experience

🗞️ News

Selected Publications

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

Multi-property Steering of Large Language Models with Dynamic Activation Composition

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

A Primer on the Inner Workings of Transformer-based Language Models

Quantifying the Plausibility of Context Reliance in Neural Machine Translation

RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation

Blog posts

ICLR 2020 Trends: Better & Faster Transformers for Natural Language Processing

Recent & Upcoming Talks

Projects

Attributing Context Usage in Language Models

Inseq: An Interpretability Toolkit for Sequence Generation Models

Contrastive Image-Text Pretraining for Italian

Covid-19 Semantic Browser

AItalo Svevo: Letters from an Artificial Intelligence

Histopathologic Cancer Detection with Neural Networks