Selected Publications | Gabriele Sarti
Home
About me
Publications
Blog
Talks
Projects
Activities
CV
Communities
AI2S
AISIG
Selected Publications
Type
Conference paper
Journal article
Preprint
Thesis
Date
2023
2022
2021
2020
0001
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
We propose DecoderLens, a method to evaluate the iterative refinement of representations in encoder-decoder Transformer models.
Published in: Arxiv
Anna Langedijk
,
Hosein Mohebbi
,
Gabriele Sarti
,
Willem Zuidema
,
Jaap Jumelet
PDF
Cite
ArXiv
Hugging Face
Quantifying the Plausibility of Context Reliance in Neural Machine Translation
We introduce PECoRe, an interpretability framework for identifying context dependence in language model generations.
Published in: Arxiv
Gabriele Sarti
,
Grzegorz Chrupała
,
Malvina Nissim
,
Arianna Bisazza
PDF
Cite
ArXiv
Hugging Face
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation
We introduce Retrieval and Attribute-Marking enhanced Prompting (RAMP) to perform attribute-controlled MT with multilingual LLMs.
Published in: ACL 2023
Gabriele Sarti
,
Phu Mon Htut
,
Xing Niu
,
Benjamin Hsu
,
Anna Currey
,
Georgiana Dinu
,
Maria Nadejde
PDF
Cite
Paper
ArXiv
Are Character-level Translations Worth the Wait? Comparing Character- and Subword-level Models for Machine Translation
We analyze input contributions of char-level MT models and show how they modulate word and character-level information.
Published in: Arxiv
Lukas Edman
,
Gabriele Sarti
,
Antonio Toral
,
Gertjan van Noord
,
Arianna Bisazza
PDF
Cite
ArXiv
Hugging Face
Inseq: An Interpretability Toolkit for Sequence Generation Models
We present Inseq, a Python library to democratize access to interpretability analyses of sequence generation models.
Published in: ACL Demo 2023
Gabriele Sarti
,
Nils Feldhus
,
Ludwig Sickert
,
Oskar van der Wal
,
Malvina Nissim
,
Arianna Bisazza
PDF
Cite
Project
Paper
ArXiv
Docs
Repository
PyPI
Twitter
Discord
Hugging Face
Tutorial
DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages
DivEMT is a publicly available post-editing study of Neural Machine Translation over a typologically diverse set of target languages.
Published in: EMNLP 2022
Gabriele Sarti
,
Arianna Bisazza
,
Ana Guerberof Arenas
,
Antonio Toral
PDF
Cite
ArXiv
Dataset
Code
Demo
IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation
IT5s are the first encoder-decoder transformers pretrained on more than 40 billion Italian words.
Published in: Arxiv
Gabriele Sarti
,
Malvina Nissim
PDF
Cite
ArXiv
Models
Code
Demo
Contrastive Language-Image Pre-training for the Italian Language
We present the first CLIP model for the Italian Language (CLIP-Italian), trained on more than 1.4 million image-text pairs.
Published in: Arxiv
Federico Bianchi
,
Giuseppe Attanasio
,
Raphael Pisoni
,
Silvia Terragni
,
Gabriele Sarti
,
Sri Lakshmi
PDF
Cite
Project
ArXiv
Model
Code
Demo
That Looks Hard: Characterizing Linguistic Complexity in Humans and Language Models
This paper investigates the relationship between two complementary perspectives in the human assessment of sentence complexity and how …
Published in: In CMCL 2021
Gabriele Sarti
,
Dominique Brunato
,
Felice Dell’Orletta
PDF
Cite
Code
DOI
Teaching NLP with Bracelets and Restaurant Menus: An Interactive Workshop for Italian Students
We developed an interactive workshop designed to illustrate the NLP and computational linguistics to Italian high schoolers.
Published in: In TeachingNLP 2021
Ludovica Pannitto
,
Lucia Busso
,
Claudia Roberta Combei
,
Lucio Messina
,
Alessio Miaschi
,
Gabriele Sarti
,
Malvina Nissim
PDF
Cite
Code
Video
DOI
Annex
UmBERTo-MTSA@ AcCompl-It: Improving Complexity and Acceptability Prediction with Multi-task Learning on Self-Supervised Annotations
This work describes a self-supervised data augmentation approach used to improve learning models’ performances when only a …
Published in: In EVALITA 2020
Gabriele Sarti
PDF
Cite
Code
Video
ArXiv
Interpreting Neural Language Models for Linguistic Complexity Assessment
This thesis presents a model-driven study of multiple phenomena associated with linguistic complexity, and how those get encoded by …
Published in: MSc Thesis @ UniTrieste
Gabriele Sarti
PDF
Cite
Code
Gitbook
ETC-NLG: End-to-end Topic-Conditioned Natural Language Generation
We present ETC-NLG, an approach leveraging topic modeling annotations to enable fully-unsupervised End-to-end Topic-Conditioned Natural …
Published in: IJCoL
Ginevra Carbone
,
Gabriele Sarti
PDF
Cite
Code
Video
ArXiv
IJCoL
Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties
We investigate whether and how using different architectures of probing models affects the performance of Italian transformers in …
Published in: CLiC-it 2020 & IJCoL
Alessio Miaschi
,
Gabriele Sarti
,
Dominique Brunato
,
Felice Dell’Orletta
,
Giulia Venturi
Cite
CLiC-it 2020
IJCoL 2022
Cite
×