Picture for Soham Ghosh

Soham Ghosh

TIPS: Text-Image Pretraining with Spatial Awareness

Add code
Oct 21, 2024
Figure 1 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 2 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 3 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 4 for TIPS: Text-Image Pretraining with Spatial Awareness
Viaarxiv icon

Pixtral 12B

Add code
Oct 09, 2024
Figure 1 for Pixtral 12B
Figure 2 for Pixtral 12B
Figure 3 for Pixtral 12B
Figure 4 for Pixtral 12B
Viaarxiv icon

ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images

Add code
Aug 30, 2024
Figure 1 for ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Figure 2 for ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Figure 3 for ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Figure 4 for ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Viaarxiv icon

Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners

Add code
Dec 09, 2022
Viaarxiv icon

ExCL: Extractive Clip Localization Using Natural Language Descriptions

Add code
Apr 04, 2019
Figure 1 for ExCL: Extractive Clip Localization Using Natural Language Descriptions
Figure 2 for ExCL: Extractive Clip Localization Using Natural Language Descriptions
Figure 3 for ExCL: Extractive Clip Localization Using Natural Language Descriptions
Viaarxiv icon

Concurrent Meta Reinforcement Learning

Add code
Mar 07, 2019
Figure 1 for Concurrent Meta Reinforcement Learning
Figure 2 for Concurrent Meta Reinforcement Learning
Figure 3 for Concurrent Meta Reinforcement Learning
Figure 4 for Concurrent Meta Reinforcement Learning
Viaarxiv icon