Picture for Anoop Cherian

Anoop Cherian

ComplexVAD: Detecting Interaction Anomalies in Video

Add code
Jan 16, 2025
Viaarxiv icon

SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera

Add code
Dec 22, 2024
Viaarxiv icon

Manual-PA: Learning 3D Part Assembly from Instruction Diagrams

Add code
Nov 27, 2024
Viaarxiv icon

LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models

Add code
Nov 12, 2024
Viaarxiv icon

Temporally Grounding Instructional Diagrams in Unconstrained Videos

Add code
Jul 16, 2024
Figure 1 for Temporally Grounding Instructional Diagrams in Unconstrained Videos
Figure 2 for Temporally Grounding Instructional Diagrams in Unconstrained Videos
Figure 3 for Temporally Grounding Instructional Diagrams in Unconstrained Videos
Figure 4 for Temporally Grounding Instructional Diagrams in Unconstrained Videos
Viaarxiv icon

Disentangled Acoustic Fields For Multimodal Physical Scene Understanding

Add code
Jul 16, 2024
Figure 1 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 2 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 3 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 4 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Viaarxiv icon

Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

Add code
Jun 22, 2024
Viaarxiv icon

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

Add code
Apr 25, 2024
Viaarxiv icon

Multi-level Reasoning for Robotic Assembly: From Sequence Inference to Contact Selection

Add code
Dec 17, 2023
Viaarxiv icon

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Add code
Sep 30, 2023
Viaarxiv icon