Picture for Zhizhou Sha

Zhizhou Sha

The Path Not Taken: RLVR Provably Learns Off the Principals

Add code
Nov 11, 2025
Viaarxiv icon

Discriminator-Free Direct Preference Optimization for Video Diffusion

Add code
Apr 11, 2025
Viaarxiv icon

HOFAR: High-Order Augmentation of Flow Autoregressive Transformers

Add code
Mar 11, 2025
Figure 1 for HOFAR: High-Order Augmentation of Flow Autoregressive Transformers
Figure 2 for HOFAR: High-Order Augmentation of Flow Autoregressive Transformers
Figure 3 for HOFAR: High-Order Augmentation of Flow Autoregressive Transformers
Figure 4 for HOFAR: High-Order Augmentation of Flow Autoregressive Transformers
Viaarxiv icon

On Computational Limits of FlowAR Models: Expressivity and Efficiency

Add code
Feb 23, 2025
Viaarxiv icon

Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling

Add code
Feb 12, 2025
Figure 1 for Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling
Figure 2 for Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling
Figure 3 for Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling
Figure 4 for Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling
Viaarxiv icon

RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation

Add code
Jan 17, 2025
Viaarxiv icon

On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis

Add code
Jan 08, 2025
Figure 1 for On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis
Viaarxiv icon

HSR-Enhanced Sparse Attention Acceleration

Add code
Oct 14, 2024
Figure 1 for HSR-Enhanced Sparse Attention Acceleration
Figure 2 for HSR-Enhanced Sparse Attention Acceleration
Viaarxiv icon

Looped ReLU MLPs May Be All You Need as Practical Programmable Computers

Add code
Oct 12, 2024
Viaarxiv icon

Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time

Add code
Aug 23, 2024
Viaarxiv icon