Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stratos Xenouleas

Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Jun 08, 2022

Stratos Xenouleas, Alexia Tsoukara, Giannis Panagiotakis, Ilias Chalkidis, Ion Androutsopoulos

Figure 1 for Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Figure 2 for Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Figure 3 for Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Figure 4 for Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification

Abstract:We consider zero-shot cross-lingual transfer in legal topic classification using the recent MultiEURLEX dataset. Since the original dataset contains parallel documents, which is unrealistic for zero-shot cross-lingual transfer, we develop a new version of the dataset without parallel documents. We use it to show that translation-based methods vastly outperform cross-lingual fine-tuning of multilingually pre-trained models, the best previous zero-shot transfer method for MultiEURLEX. We also develop a bilingual teacher-student zero-shot transfer approach, which exploits additional unlabeled documents of the target language and performs better than a model fine-tuned directly on labeled target language documents.

* 4 pages, short paper at the 12th Hellenic Conference on Artificial Intelligence (SETN 2022)

Via

Access Paper or Ask Questions

SumQE: a BERT-based Summary Quality Estimation Model

Sep 02, 2019

Stratos Xenouleas, Prodromos Malakasiotis, Marianna Apidianaki, Ion Androutsopoulos

Figure 1 for SumQE: a BERT-based Summary Quality Estimation Model

Figure 2 for SumQE: a BERT-based Summary Quality Estimation Model

Figure 3 for SumQE: a BERT-based Summary Quality Estimation Model

Figure 4 for SumQE: a BERT-based Summary Quality Estimation Model

Abstract:We propose SumQE, a novel Quality Estimation model for summarization based on BERT. The model addresses linguistic quality aspects that are only indirectly captured by content-based approaches to summary evaluation, without involving comparison with human references. SumQE achieves very high correlations with human ratings, outperforming simpler models addressing these linguistic aspects. Predictions of the SumQE model can be used for system development, and to inform users of the quality of automatically produced summaries and other types of generated text.

* In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019), Hong Kong, China, 2019

Via

Access Paper or Ask Questions