Picture for Yi Xin

Yi Xin

TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation

Add code
Mar 10, 2025
Viaarxiv icon

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Add code
Feb 10, 2025
Viaarxiv icon

Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling

Add code
Oct 14, 2024
Figure 1 for Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Figure 2 for Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Figure 3 for Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Figure 4 for Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Viaarxiv icon

Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending

Add code
Sep 17, 2024
Viaarxiv icon

Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective

Add code
Sep 11, 2024
Figure 1 for Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective
Figure 2 for Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective
Figure 3 for Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective
Figure 4 for Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective
Viaarxiv icon

Enhancing Test Time Adaptation with Few-shot Guidance

Add code
Sep 02, 2024
Viaarxiv icon

D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Add code
Jun 18, 2024
Figure 1 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 2 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 3 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 4 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Viaarxiv icon

Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model

Add code
May 24, 2024
Viaarxiv icon

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

Add code
May 23, 2024
Figure 1 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 2 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 3 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 4 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Viaarxiv icon

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey

Add code
Feb 08, 2024
Viaarxiv icon