First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT

Publication
EACL