Picture for Shuai Yang

Shuai Yang

Towards Efficient Agents: A Co-Design of Inference Architecture and System

Add code
Dec 20, 2025
Viaarxiv icon

Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers

Add code
Dec 18, 2025
Figure 1 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 2 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 3 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 4 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Viaarxiv icon

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Add code
Dec 16, 2025
Viaarxiv icon

LongLive: Real-time Interactive Long Video Generation

Add code
Sep 26, 2025
Viaarxiv icon

LINR Bridge: Vector Graphic Animation via Neural Implicits and Video Diffusion Priors

Add code
Sep 09, 2025
Viaarxiv icon

ANYPORTAL: Zero-Shot Consistent Video Background Replacement

Add code
Sep 09, 2025
Figure 1 for ANYPORTAL: Zero-Shot Consistent Video Background Replacement
Figure 2 for ANYPORTAL: Zero-Shot Consistent Video Background Replacement
Figure 3 for ANYPORTAL: Zero-Shot Consistent Video Background Replacement
Figure 4 for ANYPORTAL: Zero-Shot Consistent Video Background Replacement
Viaarxiv icon

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

Add code
Aug 14, 2025
Viaarxiv icon

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Add code
Jul 23, 2025
Viaarxiv icon

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

Add code
Jun 24, 2025
Figure 1 for CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation
Figure 2 for CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation
Figure 3 for CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation
Figure 4 for CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation
Viaarxiv icon

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Add code
Jun 12, 2025
Figure 1 for GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
Figure 2 for GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
Figure 3 for GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
Figure 4 for GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
Viaarxiv icon