Skip to Content Skip to Navigation

About us Practical matters How to find us G. (Gabriele) Sarti, MSc

G. (Gabriele) Sarti, MSc

PhD Student in Computational Linguistics

Faculty of Arts

Telephone:

E-mail:

g.sarti rug.nl

Homepage ORCID Research Portal

Research interests

I am a PhD student at the Computational Linguistics Group of the University of Groningen and member of the InDeep consortium, working on user-centric interpretability for neural machine translation. I am also the main developer of the Inseq library. My supervisors are Arianna Bisazza, Malvina Nissim and Grzegorz Chrupała. My research aims to empower users of neural machine translation systems by developing interpretable and interactive tools for more efficient professional post-editing.

My research interests focus on interpretability for NLP models, in particular to the benefit of end-users and by leveraging human behavioral signals. I am also passionate about language games, uncertainty estimation and open source collaboration.

Publications

A Primer on the Inner Workings of Transformer-based Language Models

Ferrando, J., Sarti, G., Bisazza, A. & Costa-jussà, M., 30-Apr-2024.

Research output: Working paper › Preprint › Academic

Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation

Edman, L., Sarti, G., Toral, A., van Noord, G. & Bisazza, A., 16-Apr-2024, In: Transactions of the Association for Computational Linguistics. 12, p. 392-410 19 p.

Research output: Contribution to journal › Article › Academic › peer-review

DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers

Langedijk, A., Mohebbi, H., Sarti, G., Zuidema, W. & Jumelet, J., Jun-2024, Findings of the Association for Computational Linguistics: NAACL 2024 - Findings. Duh, K., Gomez, H. & Bethard, S. (eds.). Association for Computational Linguistics, ACL Anthology, p. 4764-4780 17 p. (Findings of the Association for Computational Linguistics: NAACL 2024 - Findings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

Democratizing Advanced Attribution Analyses of Generative Language Models with the Inseq Toolkit

Sarti, G., Feldhus, N., Qi, J., Nissim, M. & Bisazza, A., 2024, xAI-2024 Late-breaking Work, Demos and Doctoral Consortium Joint Proceedings. Longo, L., Liu, W. & Montavon, G. (eds.). CEUR Workshop Proceedings (CEUR-WS.org), p. 289-296 8 p. (CEUR Workshop Proceedings; vol. 3793).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

IT5: Text-to-text Pretraining for Italian Language Understanding and Generation

Sarti, G. & Nissim, M., 2024, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Calzolari, N., Kan, M.-Y., Hoste, V., Lenci, A., Sakti, S. & Xue, N. (eds.). European Language Resources Association (ELRA), p. 9422-9433 12 p.

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

Qi, J., Sarti, G., Fernández, R. & Bisazza, A., 2024, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Al-Onaizan, Y., Bansal, M. & Chen, Y.-N. (eds.). Association for Computational Linguistics (ACL), p. 6037-6053 17 p.

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

Multi-property Steering of Large Language Models with Dynamic Activation Composition

Scalena, D., Sarti, G. & Nissim, M., Nov-2024, Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP. Belinkov, Y., Kim, N., Jumelet, J., Mohebbi, H., Mueller, A. & Chen, H. (eds.). Association for Computational Linguistics (ACL), p. 577–603 27 p.

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

Non Verbis, Sed Rebus: Large Language Models Are Weak Solvers of Italian Rebuses

Sarti, G., Caselli, T., Nissim, M. & Bisazza, A., Dec-2024, Proceedings of the 10th Italian Conference on Computational Linguistics. Dell'Orletta, F., Lenci, A., Montemagni, S. & Sprugnoli, R. (eds.). CEUR Workshop Proceedings (CEUR-WS.org), (CEUR Workshop Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

Contrastive Language-Image Pre-training for the Italian Language

Bianchi, F., Attanasio, G., Pisoni, R., Terragni, S., Sarti, G. & Balestri, D., 2023, Proceedings of the 9th Italian Conference on Computational Linguistics. Boschetti, F., Lebani, G. E., Magnini, B. & Novielli, N. (eds.). CEUR Workshop Proceedings (CEUR-WS.org), 8 p. (CEUR Workshop Proceedings; vol. 3596).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers

Langedijk, A., Mohebbi, H., Sarti, G., Zuidema, W. & Jumelet, J., 5-Oct-2023, (Submitted) arXiv.

Research output: Working paper › Preprint › Academic

All (26) publications

Press/media

Can Word-level Quality Estimation Inform and Improve Machine Translation Post-editing?

Sarti, G. & Bisazza, A.

10/12/2024

Press/Media: Research › Professional

Intelligenza Artificiale: Open letter from the scientific community for the regulation of generative models in the AI Act

Sarti, G.

30/11/2023

Press/Media: Expert Comment › Popular

The AI that explains images in Italian

Bianchi, F., Attanasio, G., Pisoni, R., Terragni, S., Sarti, G. & Lakshimi, S.

13/09/2021

Press/Media: Research › Academic

CLIP-Italian: A new AI model to connect images and texts in Italian

Bianchi, F., Attanasio, G., Pisoni, R., Terragni, S., Sarti, G. & Lakshimi, S.

26/08/2021

Press/Media: Research › Academic

All (4) press/media

View full research profile

Follow the UGfacebook linkedin rss instagram youtube