Picture for Xiaohao Xu

Xiaohao Xu

Robust Latent Matters: Boosting Image Generation with Sampling Error

Add code
Mar 11, 2025
Viaarxiv icon

Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity

Add code
Mar 08, 2025
Viaarxiv icon

Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection

Add code
Mar 06, 2025
Viaarxiv icon

Large Language Models as Natural Selector for Embodied Soft Robot Design

Add code
Mar 04, 2025
Viaarxiv icon

Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video

Add code
Jan 24, 2025
Viaarxiv icon

Hierarchical Multi-Graphs Learning for Robust Group Re-Identification

Add code
Dec 25, 2024
Figure 1 for Hierarchical Multi-Graphs Learning for Robust Group Re-Identification
Figure 2 for Hierarchical Multi-Graphs Learning for Robust Group Re-Identification
Figure 3 for Hierarchical Multi-Graphs Learning for Robust Group Re-Identification
Figure 4 for Hierarchical Multi-Graphs Learning for Robust Group Re-Identification
Viaarxiv icon

Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties

Add code
Dec 19, 2024
Figure 1 for Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties
Figure 2 for Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties
Figure 3 for Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties
Figure 4 for Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties
Viaarxiv icon

MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction

Add code
Dec 12, 2024
Figure 1 for MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction
Figure 2 for MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction
Figure 3 for MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction
Figure 4 for MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction
Viaarxiv icon

Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity

Add code
Dec 09, 2024
Viaarxiv icon

OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining

Add code
Nov 23, 2024
Figure 1 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 2 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 3 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 4 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Viaarxiv icon