Picture for Haoran Xu

Haoran Xu

Temporal Triplane Transformers as Occupancy World Models

Add code
Mar 10, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models

Add code
Mar 02, 2025
Viaarxiv icon

LEAP: Enhancing Vision-Based Occupancy Networks with Lightweight Spatio-Temporal Correlation

Add code
Feb 21, 2025
Viaarxiv icon

GS-Cache: A GS-Cache Inference Framework for Large-scale Gaussian Splatting Models

Add code
Feb 20, 2025
Viaarxiv icon

iMOVE: Instance-Motion-Aware Video Understanding

Add code
Feb 18, 2025
Viaarxiv icon

Towards Efficient Pre-training: Exploring FP4 Precision in Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Federated Learning with Sample-level Client Drift Mitigation

Add code
Jan 20, 2025
Viaarxiv icon

PEACE: Empowering Geologic Map Holistic Understanding with MLLMs

Add code
Jan 10, 2025
Figure 1 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 2 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 3 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 4 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Viaarxiv icon

Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment

Add code
Jan 06, 2025
Figure 1 for Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment
Figure 2 for Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment
Figure 3 for Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment
Figure 4 for Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment
Viaarxiv icon