Activities | Gabriele Sarti

Activities

A summary of my professional activities

πŸ‘¨β€πŸ« Teaching

I'm not currently teaching any course.

See previous courses

Natural Language Processing

February, 2022 – April, 2022 , with Arianna Bisazza
MSc Information ScienceUniversity of Groningen

Natural Language Processing

February, 2023 – April, 2023 , with Arianna Bisazza
MSc Information ScienceUniversity of Groningen

Natural Language Processing

February, 2024 – April, 2023 , with Arianna Bisazza, Jirui Qi, Leonidas Zotos
MSc Information ScienceUniversity of Groningen

Fundamentals of Machine Learning: Theory and Practice

September, 2024 – October, 2024
Data Wise: Data Science in Society MinorUniversity of Groningen

πŸŽ“ Supervision

You?

I am often looking for motivated students to work with.
If you are interested in working with me, please feel free to reach out! :)
See previously supervised students
Samuele D'Avenia

Samuele D’Avenia

April, 2024 – October, 2024 MSc in Data Science, University of Trieste
MSc Thesis: Interpretability in VLMs: Extending the PECoRe Framework to Image Context
Sara Candussio

Sara Candussio

April, 2024 – September, 2024 MSc in Data Science, University of Trieste
MSc Thesis: A Dialectic Pipeline for Improving LLM Robustness
Konstantin Chernyshev

Konstantin Chernyshev

April, 2023 – October, 2024 MSc in LCT, RUG & Saarland University
MSc Project: Improving the Identification and Categorization of Word-level QE for MT
MSc Thesis: Fast and Efficient Structured Pruning of LLMs with Gradient-Based Meta-Mask
Daniel Scalena

Daniel Scalena

April, 2023 – October, 2023 MSc in CS, University Milano-Bicocca

Qiankun Zheng

October, 2022 – April, 2023 MSc in LCT, RUG & Saarland University
MSc Project: Cross-lingual Analysis of Neural Machine Translation Post-Edits
Ludwig Sickert

Ludwig Sickert

April, 2022 – July, 2023 MSc in AI, RUG
MSc Thesis: Assessing Formality in Machine Translation through Interpretability Methods

🀝 Academic Service

Reviewing

CLiC-it 2021, LREC 2022, EMNLP 2022, ACL 2023, EMNLP 2023, BlackboxNLP 2023, NAACL 2024, COLM 2024, MechInterp WS 2024, EMNLP 2024, BlackboxNLP 2024, CLiC-it 2024, ICLR 2025

πŸ—“οΈ Attended Events

CLiC-it 2024

December, 2024 Pisa, Italy
Poster + 2 Research Communications + CALAMITA Task

EMNLP 2024

November, 2024 Miami, FL, USA
Main + BlackboxNLP WS

XAI 2024

July, 2024 Valletta, Malta
See previous events

LREC-COLING 2024

May, 2024 Turin, Italy
Main conference short paper

ICLR 2024

May, 2024 Vienna, Austria
Main conference paper

EMNLP 2023

December, 2023 Singapore
Two long abstracts at the BlackboxNLP workshop

ACL 2023

July, 2023 Toronto, ON, Canada

REST-CL 2023

June, 2023 Sant Jaume dels Domenys, Spain
Hosted a tutorial on interpretability for generative language models

Lectures on Computational Linguistics 2023

May, 2023 Pisa, Italy
Hosted a lab session on interpretability for neural language models

EMNLP 2022

December, 2022 Abu Dhabi, UAE

EAMT 2022

June, 2022 Ghent, Belgium

InDeep Launch Event

May, 2022 Amsterdam, Netherlands

NITS 2022

May, 2022 Groningen, Netherlands
Presented my PhD project poster

XAI4Debugging WS @ NeurIPS 2021

January, 2022 Online

ALPS Winter School 2022

January, 2022 Online

EMNLP 2021

November, 2021 Online

NAACL 2021

June, 2021 Online

CLiC-it 2020

March, 2021 Online

EVALITA 2020

December, 2020 Online
Special student paper mention for UmBERTo-MTSA

NL4AI WS @ AIxIA 2020

December, 2020 Online

EMNLP & CoNLL 2020

November, 2020 Online

ACL 2020

July, 2020 Online

ICLR 2020

April, 2020 Online

CLiC-it 2019

November, 2019 Bari, Italy

ACL 2019

July, 2019 Florence, Italy

πŸŽ‰ Other Happenings

2023

May 2023: I was awarded two research grants from the Imminent Research Center and the Amsterdam eScience Center to fund the development of the Inseq library and my future research on machine translation.

February 2023: Inseq, our open-source toolkit for post-hoc interpretability of generative language models, is now available on Github! πŸ› We also have a demo paper with some usage examples.


2022

July to Sep, 2022: Spent the summer at Amazon AWS New York working as an Applied Scientist Intern with the Amazon Translate team, working with Georgiana Dinu, Maria Nădejde, Xing Niu and Benjamin Hsu.

May 23-30, 2022: Hosted Michael Carl at the University of Groningen, with talks about translation processes and interpretability for machine translation.


2021

Sep 1, 2021: Starting a PhD in interpretable neural machine translation at the worderful GroNLP group under the supervision of Arianna Bisazza, Malvina Nissim and Grzegorz ChrupaΕ‚a, as a part of the NWO-funded consortium InDeep: Interpreting Deep Learning Models for Text and Sound.

Jul 31, 2021: Last day of work at Aindo. In the last months I worked on structured prediction from clinical reports, few-shot cross-lingual transfer for neural QA models, GNN for molecular properties prediction, neural recommender systems and sketch-to-image generation with GANs.

Jul 23, 2021: Participated in the HuggingFace JAX/Flax Community Week building the first CLIP image-text model for Italian (demo here) and the first T5 seq2seq model pre-trained on Italian. I also cleaned the largest available Italian corpus to date, the Italian split of mc4 (full size after cleaning ~103M docs, 41B words, 215Gb), making it available on the HuggingFace Hub.

Jan 1, 2021: Started working full-time as ML research scientist for Aindo.


2020

Dec 11, 2020: Graduated cum laude from the Data Science MSc at the University of Trieste and SISSA with my thesis Interpreting Neural Langauge Models for Linguistic Complexity Assessment, under the supervision of Felice dell’Orletta and Davide Crepaldi.

Oct 19, 2020: Developed and open-sourced LambdaBert, a BERT implementation based on lambda layers instead of self-attention. Viral on Made with ML.

Sep 28, 2020: Attended the “Deep Learning Theories in Neuroscience” session at the Oxford Virtual Autumn School in Neuroscience.

May 8, 2020: Organizing the AI, Stats & COVID-19 event with AI2S, with more than 100 participants from around the world.

Mar to Jun, 2020: Main developer for the Covid-19 Semantic Browser, a joint project by AILC and Area Science Park, covered in the news by many sources, including the NLP Newsletter by Sebastian Ruder and the Anthony Goldbloom talk at the Stanford HAI conference on COVID-19 and AI

Mar 22, 2020: Officially founded AI2S, the first AI student society in Friuli-Venezia Giulia.

Feb 13, 2020: Presented the “Hey Siri, what’s computational linguistics” interactive talk at SISSA Student Day 2020.

Feb 10-14, 2020: Attended the course “Eye-tracking Methods for Cognitive Science” by Elizabeth Schotter at SISSA.


2019

Sep to Dec, 2019: Internship at the ItaliaNLP Lab of the Institute for Computational Linguistics ILC-CNR in Pisa. Worked under the supervision of Felice Dell’Orletta on linguistic complexity assessment with neural language models.