Activities | Gabriele Sarti


A summary of my professional activities

👨‍🏫 Teaching

Natural Language Processing

February, 2022 – April, 2022 , with Arianna Bisazza
MSc Information ScienceUniversity of Groningen

Natural Language Processing

February, 2023 – April, 2023 , with Arianna Bisazza
MSc Information ScienceUniversity of Groningen

🎓 Supervision

Qiankun Zheng

October, 2022 – Ongoing MSc in LCT, RUG & Saarland University
Project: TBD

Ludwig Sickert

April, 2022 – Ongoing MSc in AI, RUG
Project: TBD

🤝 Academic Service


CLiC-it 2021, LREC 2022, EMNLP 2022, ACL 2023

🗓️ Attended Events

See all events

EMNLP 2022

December, 2022 Abu Dhabi, UAE

EAMT 2022

June, 2022 Ghent, Belgium

InDeep Launch Event

May, 2022 Amsterdam, Netherlands

NITS 2022

May, 2022 Groningen, Netherlands
Presented my PhD project poster

ALPS Winter School 2022

January, 2022 Online

XAI4Debugging WS @ NeurIPS 2021

January, 2022 Online

EMNLP 2021

November, 2021 Online

NAACL 2021

June, 2021 Online

CLiC-it 2020

March, 2021 Online


December, 2020 Online
Special student paper mention for UmBERTo-MTSA

NL4AI WS @ AIxIA 2020

December, 2020 Online

EMNLP & CoNLL 2020

November, 2020 Online

ACL 2020

July, 2020 Online

ICLR 2020

April, 2020 Online

CLiC-it 2019

November, 2019 Bari, Italy

ACL 2019

July, 2019 Florence, Italy

🎉 Other Happenings


July to Sep, 2022: Spent the summer at Amazon AWS New York working as an Applied Scientist Intern with the Amazon Translate team, working with Georgiana Dinu, Maria Nădejde, Xing Niu and Benjamin Hsu.

May 23-30, 2022: Hosted Michael Carl at the University of Groningen, with talks about translation processes and interpretability for machine translation.


Sep 1, 2021: Starting a PhD in interpretable neural machine translation at the worderful GroNLP group under the supervision of Arianna Bisazza, Malvina Nissim and Grzegorz Chrupała, as a part of the NWO-funded consortium InDeep: Interpreting Deep Learning Models for Text and Sound.

Jul 31, 2021: Last day of work at Aindo. In the last months I worked on structured prediction from clinical reports, few-shot cross-lingual transfer for neural QA models, GNN for molecular properties prediction, neural recommender systems and sketch-to-image generation with GANs.

Jul 23, 2021: Participated in the HuggingFace JAX/Flax Community Week building the first CLIP image-text model for Italian (demo here) and the first T5 seq2seq model pre-trained on Italian. I also cleaned the largest available Italian corpus to date, the Italian split of mc4 (full size after cleaning ~103M docs, 41B words, 215Gb), making it available on the HuggingFace Hub.

Jan 1, 2021: Started working full-time as ML research scientist for Aindo.


Dec 11, 2020: Graduated cum laude from the Data Science MSc at the University of Trieste and SISSA with my thesis Interpreting Neural Langauge Models for Linguistic Complexity Assessment, under the supervision of Felice dell’Orletta and Davide Crepaldi.

Oct 19, 2020: Developed and open-sourced LambdaBert, a BERT implementation based on lambda layers instead of self-attention. Viral on Made with ML.

Sep 28, 2020: Attended the “Deep Learning Theories in Neuroscience” session at the Oxford Virtual Autumn School in Neuroscience.

May 8, 2020: Organizing the AI, Stats & COVID-19 event with AI2S, with more than 100 participants from around the world.

Mar to Jun, 2020: Main developer for the Covid-19 Semantic Browser, a joint project by AILC and Area Science Park, covered in the news by many sources, including the NLP Newsletter by Sebastian Ruder and the Anthony Goldbloom talk at the Stanford HAI conference on COVID-19 and AI

Mar 22, 2020: Officially founded AI2S, the first AI student society in Friuli-Venezia Giulia.

Feb 13, 2020: Presented the “Hey Siri, what’s computational linguistics” interactive talk at SISSA Student Day 2020.

Feb 10-14, 2020: Attended the course “Eye-tracking Methods for Cognitive Science” by Elizabeth Schotter at SISSA.


Sep to Dec, 2019: Internship at the ItaliaNLP Lab of the Institute for Computational Linguistics ILC-CNR in Pisa. Worked under the supervision of Felice Dell’Orletta on linguistic complexity assessment with neural language models.