Skip to ContentSkip to Navigation
Over ons Praktische zaken Waar vindt u ons G. (Gabriele) Sarti, MSc

Research interests

My research interests focus on interpretability for NLP models, in particular to the benefit of end-users and by leveraging human behavioral signals. I am also passionate about social applications of machine learning, ethical AI, and open source collaboration.

My PhD project, conducted under the supervision of Arianna Bisazza and Malvina Nissim, aims to empower users of neural machine translation systems by developing interpretable and interactive tools for more efficient professional post-editing.


A Primer on the Inner Workings of Transformer-based Language Models

Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation

IT5: Text-to-text Pretraining for Italian Language Understanding and Generation

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

Contrastive Language-Image Pre-training for the Italian Language

DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers

Inseq: An Interpretability Toolkit for Sequence Generation Models

Inseq: An Interpretability Toolkit for Sequence Generation Models

Quantifying the Plausibility of Context Reliance in Neural Machine Translation

RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation

Lees meer


The AI that explains images in Italian

CLIP-Italian: A new AI model to connect images and texts in Italian