Picture for Anand Mishra

Anand Mishra

Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant

Add code
Oct 24, 2024
Viaarxiv icon

Sketch-guided Image Inpainting with Partial Discrete Diffusion Process

Add code
Apr 18, 2024
Viaarxiv icon

Towards Scene-Text to Scene-Text Translation

Add code
Aug 06, 2023
Viaarxiv icon

Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering

Add code
Jun 29, 2023
Viaarxiv icon

Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch

Add code
Mar 15, 2023
Viaarxiv icon

Multimodal Query-guided Object Localization

Add code
Dec 01, 2022
Viaarxiv icon

Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

Add code
Nov 23, 2022
Viaarxiv icon

Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification

Add code
Nov 23, 2022
Figure 1 for Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification
Figure 2 for Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification
Figure 3 for Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification
Figure 4 for Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification
Viaarxiv icon

Grounding Scene Graphs on Natural Images via Visio-Lingual Message Passing

Add code
Nov 03, 2022
Viaarxiv icon

COFAR: Commonsense and Factual Reasoning in Image Search

Add code
Oct 16, 2022
Figure 1 for COFAR: Commonsense and Factual Reasoning in Image Search
Figure 2 for COFAR: Commonsense and Factual Reasoning in Image Search
Figure 3 for COFAR: Commonsense and Factual Reasoning in Image Search
Figure 4 for COFAR: Commonsense and Factual Reasoning in Image Search
Viaarxiv icon