Picture for Shivam Chandhok

Shivam Chandhok

MM-R$^3$: On (In-)Consistency of Multi-modal Large Language Models (MLLMs)

Add code
Oct 07, 2024
Viaarxiv icon

Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities

Add code
Aug 13, 2024
Figure 1 for Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities
Figure 2 for Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities
Figure 3 for Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities
Figure 4 for Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities
Viaarxiv icon

SceneGPT: A Language Model for 3D Scene Understanding

Add code
Aug 13, 2024
Figure 1 for SceneGPT: A Language Model for 3D Scene Understanding
Figure 2 for SceneGPT: A Language Model for 3D Scene Understanding
Viaarxiv icon

Do Vision-Language Foundational models show Robust Visual Perception?

Add code
Aug 13, 2024
Figure 1 for Do Vision-Language Foundational models show Robust Visual Perception?
Figure 2 for Do Vision-Language Foundational models show Robust Visual Perception?
Viaarxiv icon

Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving

Add code
Oct 03, 2023
Viaarxiv icon

Empirical Optimal Transport between Conditional Distributions

Add code
May 25, 2023
Viaarxiv icon

Hardware Software Co-design of Statistical and Deep Learning Frameworks for Wideband Sensing on Zynq System on Chip

Add code
Sep 06, 2022
Figure 1 for Hardware Software Co-design of Statistical and Deep Learning Frameworks for Wideband Sensing on Zynq System on Chip
Figure 2 for Hardware Software Co-design of Statistical and Deep Learning Frameworks for Wideband Sensing on Zynq System on Chip
Figure 3 for Hardware Software Co-design of Statistical and Deep Learning Frameworks for Wideband Sensing on Zynq System on Chip
Figure 4 for Hardware Software Co-design of Statistical and Deep Learning Frameworks for Wideband Sensing on Zynq System on Chip
Viaarxiv icon

INDIGO: Intrinsic Multimodality for Domain Generalization

Add code
Jun 13, 2022
Figure 1 for INDIGO: Intrinsic Multimodality for Domain Generalization
Viaarxiv icon

Unseen Classes at a Later Time? No Problem

Add code
Mar 30, 2022
Figure 1 for Unseen Classes at a Later Time? No Problem
Figure 2 for Unseen Classes at a Later Time? No Problem
Figure 3 for Unseen Classes at a Later Time? No Problem
Figure 4 for Unseen Classes at a Later Time? No Problem
Viaarxiv icon

Resource Constrained Neural Networks for 5G Direction-of-Arrival Estimation in Micro-controllers

Add code
Jul 23, 2021
Figure 1 for Resource Constrained Neural Networks for 5G Direction-of-Arrival Estimation in Micro-controllers
Figure 2 for Resource Constrained Neural Networks for 5G Direction-of-Arrival Estimation in Micro-controllers
Figure 3 for Resource Constrained Neural Networks for 5G Direction-of-Arrival Estimation in Micro-controllers
Figure 4 for Resource Constrained Neural Networks for 5G Direction-of-Arrival Estimation in Micro-controllers
Viaarxiv icon