Picture for Hamid Rezatofighi

Hamid Rezatofighi

Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering

Add code
Oct 27, 2024
Viaarxiv icon

TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene

Add code
Sep 26, 2024
Viaarxiv icon

NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions

Add code
Sep 16, 2024
Viaarxiv icon

How Well Can Vision Language Models See Image Details?

Add code
Aug 07, 2024
Viaarxiv icon

DrVideo: Document Retrieval Based Long Video Understanding

Add code
Jun 18, 2024
Figure 1 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 2 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 3 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 4 for DrVideo: Document Retrieval Based Long Video Understanding
Viaarxiv icon

Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning

Add code
Apr 08, 2024
Viaarxiv icon

DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation

Add code
Apr 06, 2024
Viaarxiv icon

JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups

Add code
Apr 06, 2024
Viaarxiv icon

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

Add code
Apr 02, 2024
Viaarxiv icon

HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning

Add code
Mar 19, 2024
Viaarxiv icon