Multilingual Text Classification


Multilingual text classification is the process of categorizing text documents in multiple languages into predefined categories.

Papers and Code

A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models

Jun 29, 2024
Viaarxiv icon

Untangling the Unrestricted Web: Automatic Identification of Multilingual Registers

Jun 28, 2024
Viaarxiv icon

Enhancing Multilingual Voice Toxicity Detection with Speech-Text Alignment

Add code
Jun 14, 2024
Viaarxiv icon

Universal Cross-Lingual Text Classification

Add code
Jun 16, 2024
Viaarxiv icon

Multimodal Metadata Assignment for Cultural Heritage Artifacts

Jun 01, 2024
Viaarxiv icon

Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection

Add code
May 28, 2024
Viaarxiv icon

XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples

Add code
May 08, 2024
Viaarxiv icon

Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents

Add code
May 13, 2024
Figure 1 for Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents
Figure 2 for Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents
Figure 3 for Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents
Figure 4 for Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents
Viaarxiv icon

Using Machine Translation to Augment Multilingual Classification

May 09, 2024
Viaarxiv icon

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection

Apr 22, 2024
Figure 1 for SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Figure 2 for SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Figure 3 for SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Figure 4 for SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Viaarxiv icon