Picture for Ali Furkan Biten

Ali Furkan Biten

Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia

Add code
Sep 21, 2022
Figure 1 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 2 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 3 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 4 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Viaarxiv icon

MUST-VQA: MUltilingual Scene-text VQA

Add code
Sep 14, 2022
Figure 1 for MUST-VQA: MUltilingual Scene-text VQA
Figure 2 for MUST-VQA: MUltilingual Scene-text VQA
Figure 3 for MUST-VQA: MUltilingual Scene-text VQA
Figure 4 for MUST-VQA: MUltilingual Scene-text VQA
Viaarxiv icon

Out-of-Vocabulary Challenge Report

Add code
Sep 14, 2022
Figure 1 for Out-of-Vocabulary Challenge Report
Figure 2 for Out-of-Vocabulary Challenge Report
Figure 3 for Out-of-Vocabulary Challenge Report
Figure 4 for Out-of-Vocabulary Challenge Report
Viaarxiv icon

Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

Add code
Mar 16, 2022
Figure 1 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 2 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 3 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 4 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Viaarxiv icon

OCR-IDL: OCR Annotations for Industry Document Library Dataset

Add code
Feb 25, 2022
Figure 1 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 2 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 3 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 4 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Viaarxiv icon

LaTr: Layout-Aware Transformer for Scene-Text VQA

Add code
Dec 24, 2021
Figure 1 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 2 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 3 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 4 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Viaarxiv icon

Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

Add code
Oct 06, 2021
Figure 1 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 2 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 3 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 4 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Viaarxiv icon

Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning

Add code
Oct 04, 2021
Figure 1 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 2 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 3 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 4 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Viaarxiv icon

Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild

Add code
Sep 24, 2021
Figure 1 for Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild
Figure 2 for Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild
Figure 3 for Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild
Figure 4 for Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild
Viaarxiv icon

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

Add code
May 11, 2021
Figure 1 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 2 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 3 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 4 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Viaarxiv icon