Picture for Zhenyu Yang

Zhenyu Yang

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Add code
Mar 12, 2025
Viaarxiv icon

X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation

Add code
Mar 08, 2025
Viaarxiv icon

Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

Add code
Mar 06, 2025
Viaarxiv icon

Binary Neural Networks for Large Language Model: A Survey

Add code
Feb 26, 2025
Viaarxiv icon

GS-Cache: A GS-Cache Inference Framework for Large-scale Gaussian Splatting Models

Add code
Feb 20, 2025
Viaarxiv icon

Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation

Add code
Feb 06, 2025
Viaarxiv icon

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Add code
Jan 14, 2025
Figure 1 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 2 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 3 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 4 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Viaarxiv icon

Kwai-STaR: Transform LLMs into State-Transition Reasoners

Add code
Nov 07, 2024
Viaarxiv icon

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Add code
Oct 25, 2024
Figure 1 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 2 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 3 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 4 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Viaarxiv icon

Spherical Analysis of Learning Nonlinear Functionals

Add code
Oct 01, 2024
Viaarxiv icon