Publication

(2024). SPIRIT-LM: Interleaved Spoken and Written Language Model.

PDF Project

(2023). The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages. WMT.

PDF Dataset Project

(2023). Evaluating and Modeling Attribution for Cross-Lingual Question Answering. EMNLP.

PDF Dataset

(2022). Cross-Lingual GENQA: A Language-Agnostic Generative Question Answering Approach for Open-Domain Question Answering. AACL.

PDF

(2021). When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models. NAACL.

PDF

(2021). First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT. EACL.

PDF

(2020). CamemBERT: a Tasty French Language Model. ACL.

PDF Project

(2019). Enhancing BERT for Lexical Normalization. WNUT.

PDF