Picture for Hanyu Wei

Hanyu Wei

AKVQ-VL: Attention-Aware KV Cache Adaptive 2-Bit Quantization for Vision-Language Models

Add code
Jan 25, 2025
Viaarxiv icon

Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation

Add code
Oct 18, 2022
Figure 1 for Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Figure 2 for Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Figure 3 for Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Figure 4 for Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Viaarxiv icon