Picture for Avinash Ravichandran

Avinash Ravichandran

VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition

Add code
Aug 29, 2024
Viaarxiv icon

InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models

Add code
Jul 15, 2024
Figure 1 for InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
Figure 2 for InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
Figure 3 for InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
Figure 4 for InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
Viaarxiv icon

GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR

Add code
Jun 15, 2024
Viaarxiv icon

Learning Expressive Prompting With Residuals for Vision Transformers

Add code
Mar 27, 2023
Viaarxiv icon

WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation

Add code
Mar 26, 2023
Viaarxiv icon

Introspective Cross-Attention Probing for Lightweight Transfer of Pre-trained Models

Add code
Mar 07, 2023
Viaarxiv icon

A Meta-Learning Approach to Predicting Performance and Data Requirements

Add code
Mar 02, 2023
Viaarxiv icon

ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers

Add code
Sep 13, 2022
Figure 1 for ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers
Figure 2 for ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers
Figure 3 for ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers
Figure 4 for ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers
Viaarxiv icon

Semi-supervised Vision Transformers at Scale

Add code
Aug 11, 2022
Figure 1 for Semi-supervised Vision Transformers at Scale
Figure 2 for Semi-supervised Vision Transformers at Scale
Figure 3 for Semi-supervised Vision Transformers at Scale
Figure 4 for Semi-supervised Vision Transformers at Scale
Viaarxiv icon

Masked Vision and Language Modeling for Multi-modal Representation Learning

Add code
Aug 03, 2022
Figure 1 for Masked Vision and Language Modeling for Multi-modal Representation Learning
Figure 2 for Masked Vision and Language Modeling for Multi-modal Representation Learning
Figure 3 for Masked Vision and Language Modeling for Multi-modal Representation Learning
Figure 4 for Masked Vision and Language Modeling for Multi-modal Representation Learning
Viaarxiv icon