Picture for Xiaohao Xu

Xiaohao Xu

MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction

Add code
Dec 12, 2024
Viaarxiv icon

Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity

Add code
Dec 09, 2024
Viaarxiv icon

OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining

Add code
Nov 23, 2024
Figure 1 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 2 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 3 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 4 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Viaarxiv icon

From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking

Add code
Jun 24, 2024
Figure 1 for From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking
Figure 2 for From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking
Figure 3 for From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking
Figure 4 for From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking
Viaarxiv icon

Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM

Add code
Jun 18, 2024
Viaarxiv icon

LogiCode: an LLM-Driven Framework for Logical Anomaly Detection

Add code
Jun 07, 2024
Viaarxiv icon

Self-supervised Pre-training for Transferable Multi-modal Perception

Add code
May 28, 2024
Viaarxiv icon

Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions

Add code
Mar 25, 2024
Figure 1 for Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions
Figure 2 for Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions
Figure 3 for Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions
Figure 4 for Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions
Viaarxiv icon

Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning

Add code
Mar 17, 2024
Figure 1 for Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning
Figure 2 for Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning
Figure 3 for Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning
Figure 4 for Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning
Viaarxiv icon

GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection

Add code
Mar 12, 2024
Viaarxiv icon