Picture for Andres Mafla

Andres Mafla

Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia

Add code
Sep 21, 2022
Figure 1 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 2 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 3 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 4 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Viaarxiv icon

MUST-VQA: MUltilingual Scene-text VQA

Add code
Sep 14, 2022
Figure 1 for MUST-VQA: MUltilingual Scene-text VQA
Figure 2 for MUST-VQA: MUltilingual Scene-text VQA
Figure 3 for MUST-VQA: MUltilingual Scene-text VQA
Figure 4 for MUST-VQA: MUltilingual Scene-text VQA
Viaarxiv icon

Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

Add code
Mar 16, 2022
Figure 1 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 2 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 3 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 4 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Viaarxiv icon

Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

Add code
Oct 06, 2021
Figure 1 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 2 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 3 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 4 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Viaarxiv icon

Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval

Add code
Sep 21, 2020
Figure 1 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 2 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 3 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 4 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Viaarxiv icon

Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features

Add code
Jan 14, 2020
Figure 1 for Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Figure 2 for Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Figure 3 for Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Figure 4 for Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Viaarxiv icon

ICDAR 2019 Competition on Scene Text Visual Question Answering

Add code
Jun 30, 2019
Figure 1 for ICDAR 2019 Competition on Scene Text Visual Question Answering
Figure 2 for ICDAR 2019 Competition on Scene Text Visual Question Answering
Figure 3 for ICDAR 2019 Competition on Scene Text Visual Question Answering
Figure 4 for ICDAR 2019 Competition on Scene Text Visual Question Answering
Viaarxiv icon

Scene Text Visual Question Answering

Add code
May 31, 2019
Figure 1 for Scene Text Visual Question Answering
Figure 2 for Scene Text Visual Question Answering
Figure 3 for Scene Text Visual Question Answering
Figure 4 for Scene Text Visual Question Answering
Viaarxiv icon