Picture for Hengyu Fu

Hengyu Fu

Plan, Verify and Fill: A Structured Parallel Decoding Approach for Diffusion Language Models

Add code
Jan 18, 2026
Viaarxiv icon

Neural Networks Learn Generic Multi-Index Models Near Information-Theoretic Limit

Add code
Nov 19, 2025
Viaarxiv icon

Learning Hierarchical Polynomials of Multiple Nonlinear Features with Three-Layer Networks

Add code
Nov 26, 2024
Viaarxiv icon

Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data

Add code
Jul 23, 2024
Figure 1 for Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data
Figure 2 for Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data
Figure 3 for Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data
Figure 4 for Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data
Viaarxiv icon

Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory

Add code
Mar 18, 2024
Viaarxiv icon

What can a Single Attention Layer Learn? A Study Through the Random Features Lens

Add code
Jul 21, 2023
Viaarxiv icon