Picture for Ramprasaath R. Selvaraju

Ramprasaath R. Selvaraju

TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation

Add code
Aug 14, 2022
Figure 1 for TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
Figure 2 for TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
Figure 3 for TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
Figure 4 for TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
Viaarxiv icon

Can domain adaptation make object recognition work for everyone?

Add code
Apr 23, 2022
Figure 1 for Can domain adaptation make object recognition work for everyone?
Figure 2 for Can domain adaptation make object recognition work for everyone?
Figure 3 for Can domain adaptation make object recognition work for everyone?
Figure 4 for Can domain adaptation make object recognition work for everyone?
Viaarxiv icon

CLIP-Lite: Information Efficient Visual Representation Learning from Textual Annotations

Add code
Dec 14, 2021
Figure 1 for CLIP-Lite: Information Efficient Visual Representation Learning from Textual Annotations
Figure 2 for CLIP-Lite: Information Efficient Visual Representation Learning from Textual Annotations
Figure 3 for CLIP-Lite: Information Efficient Visual Representation Learning from Textual Annotations
Figure 4 for CLIP-Lite: Information Efficient Visual Representation Learning from Textual Annotations
Viaarxiv icon

PreViTS: Contrastive Pretraining with Video Tracking Supervision

Add code
Dec 01, 2021
Figure 1 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 2 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 3 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 4 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Viaarxiv icon

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Add code
Jul 16, 2021
Figure 1 for Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Figure 2 for Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Figure 3 for Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Figure 4 for Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Viaarxiv icon

CASTing Your Model: Learning to Localize Improves Self-Supervised Representations

Add code
Dec 08, 2020
Figure 1 for CASTing Your Model: Learning to Localize Improves Self-Supervised Representations
Figure 2 for CASTing Your Model: Learning to Localize Improves Self-Supervised Representations
Figure 3 for CASTing Your Model: Learning to Localize Improves Self-Supervised Representations
Figure 4 for CASTing Your Model: Learning to Localize Improves Self-Supervised Representations
Viaarxiv icon

SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency

Add code
Oct 20, 2020
Figure 1 for SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
Figure 2 for SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
Figure 3 for SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
Figure 4 for SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
Viaarxiv icon

SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions

Add code
Jan 20, 2020
Figure 1 for SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions
Figure 2 for SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions
Figure 3 for SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions
Figure 4 for SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions
Viaarxiv icon

Trick or TReAT: Thematic Reinforcement for Artistic Typography

Add code
Mar 19, 2019
Figure 1 for Trick or TReAT: Thematic Reinforcement for Artistic Typography
Figure 2 for Trick or TReAT: Thematic Reinforcement for Artistic Typography
Figure 3 for Trick or TReAT: Thematic Reinforcement for Artistic Typography
Figure 4 for Trick or TReAT: Thematic Reinforcement for Artistic Typography
Viaarxiv icon

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded

Add code
Feb 11, 2019
Figure 1 for Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Figure 2 for Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Figure 3 for Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Figure 4 for Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Viaarxiv icon