Picture for Yuhao Chen

Yuhao Chen

Bridging the Discrete-Continuous Gap: Unified Multimodal Generation via Coupled Manifold Discrete Absorbing Diffusion

Add code
Jan 07, 2026
Viaarxiv icon

Stable Language Guidance for Vision-Language-Action Models

Add code
Jan 07, 2026
Viaarxiv icon

Specific Multi-emitter Identification: Theoretical Limits and Low-complexity Design

Add code
Dec 22, 2025
Viaarxiv icon

Avatar4D: Synthesizing Domain-Specific 4D Humans for Real-World Pose Estimation

Add code
Dec 18, 2025
Viaarxiv icon

Composite Classifier-Free Guidance for Multi-Modal Conditioning in Wind Dynamics Super-Resolution

Add code
Dec 13, 2025
Figure 1 for Composite Classifier-Free Guidance for Multi-Modal Conditioning in Wind Dynamics Super-Resolution
Figure 2 for Composite Classifier-Free Guidance for Multi-Modal Conditioning in Wind Dynamics Super-Resolution
Figure 3 for Composite Classifier-Free Guidance for Multi-Modal Conditioning in Wind Dynamics Super-Resolution
Figure 4 for Composite Classifier-Free Guidance for Multi-Modal Conditioning in Wind Dynamics Super-Resolution
Viaarxiv icon

Food Image Generation on Multi-Noun Categories

Add code
Dec 09, 2025
Viaarxiv icon

Look As You Think: Unifying Reasoning and Visual Evidence Attribution for Verifiable Document RAG via Reinforcement Learning

Add code
Nov 15, 2025
Viaarxiv icon

MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding

Add code
Nov 13, 2025
Figure 1 for MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding
Figure 2 for MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding
Figure 3 for MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding
Figure 4 for MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding
Viaarxiv icon

SCALEX: Scalable Concept and Latent Exploration for Diffusion Models

Add code
Nov 13, 2025
Viaarxiv icon

DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation

Add code
Nov 12, 2025
Viaarxiv icon