Picture for Oren Nuriel

Oren Nuriel

TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models

Add code
Nov 07, 2024
Figure 1 for TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
Figure 2 for TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
Figure 3 for TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
Figure 4 for TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
Viaarxiv icon

Enhancing Vision-Language Pre-training with Rich Supervisions

Add code
Mar 05, 2024
Figure 1 for Enhancing Vision-Language Pre-training with Rich Supervisions
Figure 2 for Enhancing Vision-Language Pre-training with Rich Supervisions
Figure 3 for Enhancing Vision-Language Pre-training with Rich Supervisions
Figure 4 for Enhancing Vision-Language Pre-training with Rich Supervisions
Viaarxiv icon

Question Aware Vision Transformer for Multimodal Reasoning

Add code
Feb 08, 2024
Figure 1 for Question Aware Vision Transformer for Multimodal Reasoning
Figure 2 for Question Aware Vision Transformer for Multimodal Reasoning
Figure 3 for Question Aware Vision Transformer for Multimodal Reasoning
Figure 4 for Question Aware Vision Transformer for Multimodal Reasoning
Viaarxiv icon

Towards Models that Can See and Read

Add code
Jan 18, 2023
Figure 1 for Towards Models that Can See and Read
Figure 2 for Towards Models that Can See and Read
Figure 3 for Towards Models that Can See and Read
Figure 4 for Towards Models that Can See and Read
Viaarxiv icon

CLIPTER: Looking at the Bigger Picture in Scene Text Recognition

Add code
Jan 18, 2023
Figure 1 for CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Figure 2 for CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Figure 3 for CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Figure 4 for CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Viaarxiv icon

Out-of-Vocabulary Challenge Report

Add code
Sep 14, 2022
Figure 1 for Out-of-Vocabulary Challenge Report
Figure 2 for Out-of-Vocabulary Challenge Report
Figure 3 for Out-of-Vocabulary Challenge Report
Figure 4 for Out-of-Vocabulary Challenge Report
Viaarxiv icon

TextAdaIN: Fine-Grained AdaIN for Robust Text Recognition

Add code
May 09, 2021
Figure 1 for TextAdaIN: Fine-Grained AdaIN for Robust Text Recognition
Figure 2 for TextAdaIN: Fine-Grained AdaIN for Robust Text Recognition
Figure 3 for TextAdaIN: Fine-Grained AdaIN for Robust Text Recognition
Figure 4 for TextAdaIN: Fine-Grained AdaIN for Robust Text Recognition
Viaarxiv icon

Permuted AdaIN: Enhancing the Representation of Local Cues in Image Classifiers

Add code
Oct 09, 2020
Figure 1 for Permuted AdaIN: Enhancing the Representation of Local Cues in Image Classifiers
Figure 2 for Permuted AdaIN: Enhancing the Representation of Local Cues in Image Classifiers
Figure 3 for Permuted AdaIN: Enhancing the Representation of Local Cues in Image Classifiers
Figure 4 for Permuted AdaIN: Enhancing the Representation of Local Cues in Image Classifiers
Viaarxiv icon