Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Giannis Panagiotakis

Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Jun 08, 2022

Stratos Xenouleas, Alexia Tsoukara, Giannis Panagiotakis, Ilias Chalkidis, Ion Androutsopoulos

Figure 1 for Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Figure 2 for Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Figure 3 for Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Figure 4 for Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Abstract:We consider zero-shot cross-lingual transfer in legal topic classification using the recent MultiEURLEX dataset. Since the original dataset contains parallel documents, which is unrealistic for zero-shot cross-lingual transfer, we develop a new version of the dataset without parallel documents. We use it to show that translation-based methods vastly outperform cross-lingual fine-tuning of multilingually pre-trained models, the best previous zero-shot transfer method for MultiEURLEX. We also develop a bilingual teacher-student zero-shot transfer approach, which exploits additional unlabeled documents of the target language and performs better than a model fine-tuned directly on labeled target language documents.

* 4 pages, short paper at the 12th Hellenic Conference on Artificial Intelligence (SETN 2022)

Via

Access Paper or Ask Questions