Picture for Hehe Fan

Hehe Fan

Seeing Is Believing? A Benchmark for Multimodal Large Language Models on Visual Illusions and Anomalies

Add code
Feb 02, 2026
Viaarxiv icon

DRFormer: A Dual-Regularized Bidirectional Transformer for Person Re-identification

Add code
Feb 01, 2026
Viaarxiv icon

TransNormal: Dense Visual Semantics for Diffusion-based Transparent Object Normal Estimation

Add code
Jan 31, 2026
Viaarxiv icon

DeMoGen: Towards Decompositional Human Motion Generation with Energy-Based Diffusion Models

Add code
Dec 26, 2025
Viaarxiv icon

Motion Matters: Motion-guided Modulation Network for Skeleton-based Micro-Action Recognition

Add code
Jul 29, 2025
Viaarxiv icon

Uni3D-MoE: Scalable Multimodal 3D Scene Understanding via Mixture of Experts

Add code
May 27, 2025
Viaarxiv icon

TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors

Add code
Apr 17, 2025
Viaarxiv icon

VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing

Add code
Feb 24, 2025
Figure 1 for VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Figure 2 for VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Figure 3 for VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Figure 4 for VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Viaarxiv icon

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Add code
Feb 05, 2025
Figure 1 for DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Figure 2 for DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Figure 3 for DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Figure 4 for DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Viaarxiv icon

BVINet: Unlocking Blind Video Inpainting with Zero Annotations

Add code
Feb 03, 2025
Figure 1 for BVINet: Unlocking Blind Video Inpainting with Zero Annotations
Figure 2 for BVINet: Unlocking Blind Video Inpainting with Zero Annotations
Figure 3 for BVINet: Unlocking Blind Video Inpainting with Zero Annotations
Figure 4 for BVINet: Unlocking Blind Video Inpainting with Zero Annotations
Viaarxiv icon