Picture for Souhail Bakkali

Souhail Bakkali

DocSum: Domain-Adaptive Pre-training for Document Abstractive Summarization

Add code
Dec 11, 2024
Viaarxiv icon

KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark

Add code
Oct 23, 2024
Figure 1 for KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark
Figure 2 for KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark
Figure 3 for KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark
Figure 4 for KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark
Viaarxiv icon

Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting

Add code
May 21, 2024
Viaarxiv icon

IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering

Add code
Mar 01, 2024
Viaarxiv icon

TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language

Add code
Sep 11, 2023
Figure 1 for TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
Figure 2 for TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
Figure 3 for TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
Figure 4 for TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
Viaarxiv icon

EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification

Add code
May 11, 2023
Viaarxiv icon

VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification

Add code
May 24, 2022
Figure 1 for VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Figure 2 for VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Figure 3 for VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Figure 4 for VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Viaarxiv icon

Face Detection in Camera Captured Images of Identity Documents under Challenging Conditions

Add code
Nov 08, 2019
Figure 1 for Face Detection in Camera Captured Images of Identity Documents under Challenging Conditions
Figure 2 for Face Detection in Camera Captured Images of Identity Documents under Challenging Conditions
Figure 3 for Face Detection in Camera Captured Images of Identity Documents under Challenging Conditions
Figure 4 for Face Detection in Camera Captured Images of Identity Documents under Challenging Conditions
Viaarxiv icon