Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

From Masks to Worlds: A Hitchhiker's Guide to World Models

Add code
Oct 23, 2025
Viaarxiv icon

One Flight Over the Gap: A Survey from Perspective to Panoramic Vision

Add code
Sep 04, 2025
Figure 1 for One Flight Over the Gap: A Survey from Perspective to Panoramic Vision
Figure 2 for One Flight Over the Gap: A Survey from Perspective to Panoramic Vision
Figure 3 for One Flight Over the Gap: A Survey from Perspective to Panoramic Vision
Figure 4 for One Flight Over the Gap: A Survey from Perspective to Panoramic Vision
Viaarxiv icon

DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes

Add code
Aug 28, 2025
Viaarxiv icon

Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding

Add code
Aug 27, 2025
Viaarxiv icon

Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning

Add code
Aug 14, 2025
Viaarxiv icon

MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs

Add code
Aug 14, 2025
Viaarxiv icon

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Add code
Aug 07, 2025
Viaarxiv icon

Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model

Add code
Jul 18, 2025
Figure 1 for Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model
Figure 2 for Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model
Figure 3 for Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model
Figure 4 for Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model
Viaarxiv icon

4KAgent: Agentic Any Image to 4K Super-Resolution

Add code
Jul 09, 2025
Viaarxiv icon

Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing

Add code
Jul 02, 2025
Viaarxiv icon