Cross Lingual Document Classification


Towards Scalable and Cross-Lingual Specialist Language Models for Oncology

Add code
Mar 11, 2025
Viaarxiv icon

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

Add code
Jun 13, 2024
Viaarxiv icon

What Drives Performance in Multilingual Language Models?

Add code
Apr 29, 2024
Viaarxiv icon

A Multi-Modal Multilingual Benchmark for Document Image Classification

Add code
Oct 25, 2023
Figure 1 for A Multi-Modal Multilingual Benchmark for Document Image Classification
Figure 2 for A Multi-Modal Multilingual Benchmark for Document Image Classification
Figure 3 for A Multi-Modal Multilingual Benchmark for Document Image Classification
Figure 4 for A Multi-Modal Multilingual Benchmark for Document Image Classification
Viaarxiv icon

L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages

Add code
Jan 04, 2024
Figure 1 for L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
Figure 2 for L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
Figure 3 for L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
Figure 4 for L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
Viaarxiv icon

AMuRD: Annotated Multilingual Receipts Dataset for Cross-lingual Key Information Extraction and Classification

Add code
Sep 18, 2023
Viaarxiv icon

A General-Purpose Multilingual Document Encoder

Add code
May 11, 2023
Viaarxiv icon

Multimodal Document Analytics for Banking Process Automation

Add code
Jul 21, 2023
Viaarxiv icon

Knowledge Graph Embeddings for Multi-Lingual Structured Representations of Radiology Reports

Add code
Sep 14, 2023
Viaarxiv icon

Are the Best Multilingual Document Embeddings simply Based on Sentence Embeddings?

Add code
Apr 28, 2023
Viaarxiv icon