Selected Publications | Gabriele Sarti
Home
Publications
Blog
Talks
Projects
Activities
CV
AI2S
Selected Publications
Type
Conference paper
Journal article
Preprint
Thesis
Date
2022
2021
2020
0001
DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages
We introduce DivEMT, the first publicly available post-editing study of Neural Machine Translation over a typologically diverse set of …
Published in: EMNLP 2022
Gabriele Sarti
,
Arianna Bisazza
,
Ana Guerberof Arenas
,
Antonio Toral
PDF
Cite
ArXiv
Dataset
Code
Demo
IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation
We present IT5, the first family of encoder-decoder transformer models pretrained specifically on Italian on more than 40 billion …
Published in: Arxiv
Gabriele Sarti
,
Malvina Nissim
PDF
Cite
ArXiv
Models
Code
Demo
Contrastive Language-Image Pre-training for the Italian Language
We present the first CLIP model for the Italian Language (CLIP-Italian), trained on more than 1.4 million image-text pairs.
Published in: Arxiv
Federico Bianchi
,
Giuseppe Attanasio
,
Raphael Pisoni
,
Silvia Terragni
,
Gabriele Sarti
,
Sri Lakshmi
PDF
Cite
Project
ArXiv
Model
Code
Demo
That Looks Hard: Characterizing Linguistic Complexity in Humans and Language Models
This paper investigates the relationship between two complementary perspectives in the human assessment of sentence complexity and how …
Published in: In CMCL 2021
Gabriele Sarti
,
Dominique Brunato
,
Felice Dell’Orletta
PDF
Cite
Code
DOI
Teaching NLP with Bracelets and Restaurant Menus: An Interactive Workshop for Italian Students
We developed an interactive workshop designed to illustrate the basic principles of NLP and computational linguistics to high school …
Published in: In TeachingNLP 2021
Ludovica Pannitto
,
Lucia Busso
,
Claudia Roberta Combei
,
Lucio Messina
,
Alessio Miaschi
,
Gabriele Sarti
,
Malvina Nissim
PDF
Cite
Code
Video
DOI
Annex
[email protected]
AcCompl-It: Improving Complexity and Acceptability Prediction with Multi-task Learning on Self-Supervised Annotations
This work describes a self-supervised data augmentation approach used to improve learning models’ performances when only a …
Published in: In EVALITA 2020
Gabriele Sarti
PDF
Cite
Code
Video
ArXiv
Interpreting Neural Language Models for Linguistic Complexity Assessment
This thesis presents a model-driven study of multiple phenomena associated with linguistic complexity, and how those get encoded by …
Published in: MSc Thesis @ UniTrieste
Gabriele Sarti
PDF
Cite
Code
Gitbook
ETC-NLG: End-to-end Topic-Conditioned Natural Language Generation
We present ETC-NLG, an approach leveraging topic modeling annotations to enable fully-unsupervised End-to-end Topic-Conditioned Natural …
Published in: IJCoL
Ginevra Carbone
,
Gabriele Sarti
PDF
Cite
Code
Video
ArXiv
IJCoL
Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties
We investigate whether and how using different architectures of probing models affects the performance of Italian transformers in …
Published in: CLiC-it 2020 & IJCoL
Alessio Miaschi
,
Gabriele Sarti
,
Dominique Brunato
,
Felice Dell’Orletta
,
Giulia Venturi
Cite
CLiC-it 2020
IJCoL 2022
Cite
×