Selected Publications | Gabriele Sarti

Steering Large Language Models for Machine Translation Personalization

We evaluate prompting and steering based methods for machine translation personalization in the literary domain.

Published in: Arxiv

Daniel Scalena*, Gabriele Sarti*, Arianna Bisazza, Elisabetta Fersini, Malvina Nissim

PDF ArXiv

Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement

We evaluate unsupervised word-level quality estimation (WQE) methods for machine translation, focusing on their robustness to human …

Published in: Arxiv

Gabriele Sarti, Vilém Zouhar, Malvina Nissim, Arianna Bisazza

PDF Code Dataset ArXiv

QE4PE: Word-level Quality Estimation for Human Post-Editing

We investigate the impact of word-level quality estimation on MT post-editing with 42 professional post-editors.

Published in: Arxiv Preprint

Gabriele Sarti, Vilém Zouhar, Grzegorz Chrupała, Ana Guerberof Arenas, Malvina Nissim, Arianna Bisazza

PDF Code Dataset Paper ArXiv GroTE Demo

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

We evaluate the rebus-solving capabilities of large language models on a new Italian dataset.

Published in: CLiC-it 2024

Gabriele Sarti, Tommaso Caselli, Arianna Bisazza, Malvina Nissim

PDF Dataset Paper CALAMITA Challenge Models & Dataset Repository Demo

Multi-property Steering of Large Language Models with Dynamic Activation Composition

We propose Dynamic Activation Composition, an adaptive approach for multi-property activation steering of LLMs

Published in: BlackboxNLP 2024

Daniel Scalena, Gabriele Sarti, Malvina Nissim

PDF ArXiv Repository

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

MIRAGE uses model internals for faithful answer attribution in retrieval-augmented generation applications.

Published in: EMNLP 2024 * Equal contribution

Jirui Qi*, Gabriele Sarti*, Raquel Fernández, Arianna Bisazza

PDF Project ArXiv Demo Repository

IT5: Text-to-text Pretraining for Italian Language Understanding and Generation

IT5s are the first encoder-decoder transformers pretrained on more than 40 billion Italian words.

Published in: LREC-COLING 2024

Gabriele Sarti, Malvina Nissim

PDF Code Dataset ArXiv Models Demo

A Primer on the Inner Workings of Transformer-based Language Models

This primer provides a concise technical introduction to the current techniques used to interpret the inner workings of …

Published in: Arxiv

Javier Ferrando, Gabriele Sarti, Arianna Bisazza, Marta Costa-jussà

PDF ArXiv

DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers

We propose DecoderLens, a method to interpret the iterative refinement of representations in encoder-decoder Transformer models.

Published in: Findings of NAACL 2024

Anna Langedijk, Hosein Mohebbi, Gabriele Sarti, Willem Zuidema, Jaap Jumelet

PDF Paper ArXiv Hugging Face

Quantifying the Plausibility of Context Reliance in Neural Machine Translation

We introduce PECoRe, an interpretability framework for identifying context dependence in language model generations.

Published in: ICLR 2024

Gabriele Sarti, Grzegorz Chrupała, Malvina Nissim, Arianna Bisazza

PDF Code Project ICLR Proceedings ArXiv Artifacts Demo

RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation

We introduce Retrieval and Attribute-Marking enhanced Prompting (RAMP) to perform attribute-controlled MT with multilingual LLMs.

Published in: ACL 2023

Gabriele Sarti, Phu Mon Htut, Xing Niu, Benjamin Hsu, Anna Currey, Georgiana Dinu, Maria Nadejde

PDF Proceedings ArXiv

Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation

We analyze input contributions of char-level MT models and show how they modulate word and character-level information.

Published in: TACL

Lukas Edman, Gabriele Sarti, Antonio Toral, Gertjan van Noord, Arianna Bisazza

PDF Journal ArXiv Hugging Face

Inseq: An Interpretability Toolkit for Sequence Generation Models

We present Inseq, a Python library to democratize access to interpretability analyses of sequence generation models.

Published in: ACL Demo 2023

Gabriele Sarti, Nils Feldhus, Ludwig Sickert, Oskar van der Wal, Malvina Nissim, Arianna Bisazza

PDF Project Proceedings ArXiv Docs Repository PyPI Twitter Discord Hugging Face Tutorial v0.6 Paper

Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties

We investigate whether and how using different architectures of probing models affects the performance of Italian transformers in …

Published in: CLiC-it 2020 & IJCoL

Alessio Miaschi, Gabriele Sarti, Dominique Brunato, Felice Dell’Orletta, Giulia Venturi

CLiC-it 2020 IJCoL 2022

DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages

DivEMT is a publicly available post-editing study of Neural Machine Translation over a typologically diverse set of target languages.

Published in: EMNLP 2022

Gabriele Sarti, Arianna Bisazza, Ana Guerberof Arenas, Antonio Toral

PDF Code Dataset Paper ArXiv Demo

Contrastive Language-Image Pre-training for the Italian Language

We present the first CLIP model for the Italian Language (CLIP-Italian), trained on more than 1.4 million image-text pairs.

Published in: CLiC-it 2023

Federico Bianchi, Giuseppe Attanasio, Raphael Pisoni, Silvia Terragni, Gabriele Sarti

PDF Project Paper ArXiv Model Code Demo

That Looks Hard: Characterizing Linguistic Complexity in Humans and Language Models

This paper investigates the relationship between two complementary perspectives in the human assessment of sentence complexity and how …

Published in: In CMCL 2021

Gabriele Sarti, Dominique Brunato, Felice Dell’Orletta

PDF Code DOI Proceedings

Teaching NLP with Bracelets and Restaurant Menus: An Interactive Workshop for Italian Students

We developed an interactive workshop designed to illustrate the NLP and computational linguistics to Italian high schoolers.

Published in: In TeachingNLP 2021

Ludovica Pannitto, Lucia Busso, Claudia Roberta Combei, Lucio Messina, Alessio Miaschi, Gabriele Sarti, Malvina Nissim

PDF Code Video DOI Proceedings Annex

UmBERTo-MTSA@ AcCompl-It: Improving Complexity and Acceptability Prediction with Multi-task Learning on Self-Supervised Annotations

This work describes a self-supervised data augmentation approach used to improve learning models’ performances when only a …

Published in: In EVALITA 2020

Gabriele Sarti

PDF Code Video ArXiv

Interpreting Neural Language Models for Linguistic Complexity Assessment

This thesis presents a model-driven study of multiple phenomena associated with linguistic complexity, and how those get encoded by …

Published in: MSc Thesis @ UniTrieste

Gabriele Sarti

PDF Code Gitbook

ETC-NLG: End-to-end Topic-Conditioned Natural Language Generation

We present ETC-NLG, an approach leveraging topic modeling annotations to enable fully-unsupervised End-to-end Topic-Conditioned Natural …

Published in: IJCoL

Ginevra Carbone, Gabriele Sarti

PDF Code Video ArXiv Journal