Picture for Anoop Cherian

Anoop Cherian

ComplexVAD: Detecting Interaction Anomalies in Video

Add code
Jan 16, 2025
Viaarxiv icon

SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera

Add code
Dec 22, 2024
Viaarxiv icon

Manual-PA: Learning 3D Part Assembly from Instruction Diagrams

Add code
Nov 27, 2024
Viaarxiv icon

LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models

Add code
Nov 12, 2024
Viaarxiv icon

Disentangled Acoustic Fields For Multimodal Physical Scene Understanding

Add code
Jul 16, 2024
Figure 1 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 2 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 3 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 4 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Viaarxiv icon

Temporally Grounding Instructional Diagrams in Unconstrained Videos

Add code
Jul 16, 2024
Figure 1 for Temporally Grounding Instructional Diagrams in Unconstrained Videos
Figure 2 for Temporally Grounding Instructional Diagrams in Unconstrained Videos
Figure 3 for Temporally Grounding Instructional Diagrams in Unconstrained Videos
Figure 4 for Temporally Grounding Instructional Diagrams in Unconstrained Videos
Viaarxiv icon

Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

Add code
Jun 22, 2024
Viaarxiv icon

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

Add code
Apr 25, 2024
Viaarxiv icon

Multi-level Reasoning for Robotic Assembly: From Sequence Inference to Contact Selection

Add code
Dec 17, 2023
Viaarxiv icon

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Add code
Sep 30, 2023
Viaarxiv icon