Welcome to my website! 👋 I am a PhD student at the Computational Linguistics Group of the University of Groningen. I am part of the NWO-funded project InDeep: Interpreting Deep Learning Models for Text and Sound, focusing on interpretability for neural machine translation. I am supervised by Arianna Bisazza, Malvina Nissim and Grzegorz Chrupała.
Previously, I was a research scientist at Aindo, a student in the Data Science MSc at University of Trieste & SISSA and a founding member of the AI Student Society. My master’s thesis with the ItaliaNLP Lab in Pisa was about the study of linguistic complexity using gaze recordings and neural language models.
My research focuses on interpretability for NLP models, in particular to the benefit of end-users and by leveraging human behavioral signals. I am also passionate about social applications of machine learning, ethical AI, and open source collaboration.
PhD in Natural Language Processing
University of Groningen (NL), 2021 - Ongoing
MSc. in Data Science and Scientific Computing
University of Trieste & SISSA (IT), 2018 - 2020
DEC in Software Management
Cégep de Saint-Hyacinthe (CA), 2015 - 2018
Applied Scientist Intern
Amazon Web Services (US), 2022
Research Scientist
Aindo (IT), 2020 - 2021
Visiting Research Assistant
ILC-CNR ItaliaNLP Lab (IT), 2019
I will spend the summer at Amazon AWS New York working as an Applied Scientist Intern with the Amazon Translate team, under the supervision of Georgiana Dinu.
Our paper DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages is out on arXiv. Code, 🤗 Dataset and demo are also available.
My paper IT5: : Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation is out on arXiv, alongside all data, checkpoints and code used for the experiments.
Starting a PhD on interpretability for NMT at the University of Groningen in September 2021. I will be part of the NWO InDeep nework, working with Arianna Bisazza, Malvina Nissim and Grzegorz Chrupała.
The first CLIP model pretrained on the Italian language.
A semantic browser for SARS-CoV-2 and COVID-19 powered by neural language models.
Generating letters with a neural language model in the style of Italo Svevo, a famous italian writer of the 20th century.
A journey into the state of the art of histopathologic cancer detection approaches.