Picture for Ju He

Ju He

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Add code
Apr 06, 2026
Viaarxiv icon

Autoregressive Image Generation with Masked Bit Modeling

Add code
Feb 09, 2026
Viaarxiv icon

Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers

Add code
May 20, 2025
Figure 1 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 2 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 3 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 4 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Viaarxiv icon

ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction

Add code
Apr 30, 2025
Viaarxiv icon

FlowTok: Flowing Seamlessly Across Text and Image Tokens

Add code
Mar 13, 2025
Viaarxiv icon

Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

Add code
Feb 27, 2025
Viaarxiv icon

Dictionary-based Framework for Interpretable and Consistent Object Parsing

Add code
Feb 26, 2025
Viaarxiv icon

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Add code
Jan 13, 2025
Figure 1 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 2 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 3 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 4 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Viaarxiv icon

FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching

Add code
Dec 19, 2024
Viaarxiv icon

Randomized Autoregressive Visual Generation

Add code
Nov 01, 2024
Viaarxiv icon