Gabriele Sarti
Home
Publications
Posts
Talks
Projects
Activities
AI2S
Selected Publications
Type
Conference paper
Journal article
Preprint
Thesis
Date
2022
2021
2020
IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation
We present IT5, the first family of encoder-decoder transformer models pretrained specifically on Italian on more than 40 billion …
Gabriele Sarti
,
Malvina Nissim
PDF
Cite
ArXiv
Models
Code
Demo
Contrastive Language-Image Pre-training for the Italian Language
We present the first CLIP model for the Italian Language (CLIP-Italian), trained on more than 1.4 million image-text pairs.
Federico Bianchi
,
Giuseppe Attanasio
,
Raphael Pisoni
,
Silvia Terragni
,
Gabriele Sarti
,
Sri Lakshmi
PDF
Cite
Project
ArXiv
Model
Code
Demo
That Looks Hard: Characterizing Linguistic Complexity in Humans and Language Models
This paper investigates the relationship between two complementary perspectives in the human assessment of sentence complexity and how …
Gabriele Sarti
,
Dominique Brunato
,
Felice Dell’Orletta
PDF
Cite
Code
DOI
Teaching NLP with Bracelets and Restaurant Menus: An Interactive Workshop for Italian Students
We developed an interactive workshop designed to illustrate the basic principles of NLP and computational linguistics to high school …
Ludovica Pannitto
,
Lucia Busso
,
Claudia Roberta Combei
,
Lucio Messina
,
Alessio Miaschi
,
Gabriele Sarti
,
Malvina Nissim
PDF
Cite
Code
Video
DOI
Annex
[email protected]
AcCompl-It: Improving Complexity and Acceptability Prediction with Multi-task Learning on Self-Supervised Annotations
This work describes a self-supervised data augmentation approach used to improve learning models’ performances when only a …
Gabriele Sarti
PDF
Cite
Code
Video
ArXiv
Italian Transformers Under the Linguistic Lens
We investigate whether and how using different architectures of probing models affects the performance of Italian transformers in …
Alessio Miaschi
,
Gabriele Sarti
,
Dominique Brunato
,
Felice Dell’Orletta
,
Giulia Venturi
PDF
Cite
Interpreting Neural Language Models for Linguistic Complexity Assessment
This thesis presents a model-driven study of multiple phenomena associated with linguistic complexity, and how those get encoded by …
Gabriele Sarti
PDF
Cite
Code
Gitbook
ETC-NLG: End-to-end Topic-Conditioned Natural Language Generation
We present ETC-NLG, an approach leveraging topic modeling annotations to enable fully-unsupervised End-to-end Topic-Conditioned Natural …
Ginevra Carbone
,
Gabriele Sarti
PDF
Cite
Code
Video
ArXiv
Journal
Cite
×