Welcome to my website! 👋 I am a postdoc at the BauLab at Northeastern University, working on interpretability interfaces and white-box methods for the evaluations ecosystem as part of the National Deep Inference Fabric (NDIF).
Previously, I was a PhD student at the University of Groningen, where I completed my thesis on actionable interpretability for machine translation as a member of the InCLoW team, the GroNLP group and the Dutch InDeep consortium. Before that, I was a applied scientist intern at Amazon Translate NYC and a research scientist at Aindo.
My current research interests include LLM reasoning, interpretability, user modeling and monitoring of agentic systems. I’m especially interested in making white-box auditing a practical part of how we evaluate frontier AI, since behavioural tests fail to surface unverbalized behaviors, and are increasingly inadequate as models get more capable. I work on ways to surface and steer the beliefs, goals, and plans behind what an agent does, and on the open infrastructure that links interpretability tools to the evaluation ecosystem. If you’re excited about these topics, shoot me a message!
Your (anonymous) constructive feedback is always welcome! 🙂
PhD in Natural Language Processing
University of Groningen (NL), 2021 - 2025
MSc. in Data Science and Scientific Computing
University of Trieste & SISSA (IT), 2018 - 2020
Postdoctoral Researcher
Northeastern University (US), 2026 -
Applied Scientist Intern
Amazon Web Services (US), 2022
Research Scientist
Aindo (IT), 2020 - 2021
I am co-organizing the BlackboxNLP Workshop at EMNLP 2026! Participate in our reproducibility challenge! 🔍
I worked with talented mentees as part of the Spring'26 SPAR Program on the project “Monitoring and Attributing Implicit Personalization in Conversational Agents, and will be a mentor for the CBAI Summer Fellowship as well! 🌱
My PhD thesis on Actionable Interpretability for Machine Translation was awarded the 2025 Best Dissertation Award by the European Association for Machine Translation! Excited to present my work in Tilburg in June 🏆
I started a postdoctoral position at Northeastern University as a member of the BauLab. Very excited to work with the NDIF team on building cutting-edge tools for interpretability research! 🔍
I graduated cum laude from my PhD in Natural Language Processing at the University of Groningen! 🎓 My thesis on Actionable Interpretability for Machine Translation is now available online. Huge thanks to my supervisors Arianna Bisazza, Malvina Nissim and Grzegorz Chrupała for their support during these years.
PhD Thesis at the University of Groningen
This dissertation aims to bridge the gap between method-centric interpretability research and outcome-centric real-world machine translation applications. We develop novel methods to understand and control language model generation, then study how to integrate these advances effectively into human translation workflows. Our research spans three interconnected macro-themes: understanding how language models exploit contextual information during generation, controlling model generation for personalized translation outputs, and integrating interpretability insights into human translation workflows.
An interpretability framework to detect and attribute context usage in language models’ generations
An open-source library to democratize access to model interpretability for sequence generation models
The first CLIP model pretrained on the Italian language.
A semantic browser for SARS-CoV-2 and COVID-19 powered by neural language models.
Generating letters with a neural language model in the style of Italo Svevo, a famous italian writer of the 20th century.