Picture for Mi Zhang

Mi Zhang

Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization

Add code
Mar 19, 2025
Viaarxiv icon

SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Add code
Mar 16, 2025
Viaarxiv icon

Revisiting Backdoor Attacks on Time Series Classification in the Frequency Domain

Add code
Mar 12, 2025
Viaarxiv icon

MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference

Add code
Feb 24, 2025
Viaarxiv icon

Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink

Add code
Jan 25, 2025
Viaarxiv icon

Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding

Add code
Nov 15, 2024
Figure 1 for Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
Figure 2 for Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
Figure 3 for Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
Figure 4 for Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
Viaarxiv icon

Autoregressive Models in Vision: A Survey

Add code
Nov 08, 2024
Figure 1 for Autoregressive Models in Vision: A Survey
Figure 2 for Autoregressive Models in Vision: A Survey
Figure 3 for Autoregressive Models in Vision: A Survey
Figure 4 for Autoregressive Models in Vision: A Survey
Viaarxiv icon

Artificial Intelligence of Things: A Survey

Add code
Oct 25, 2024
Figure 1 for Artificial Intelligence of Things: A Survey
Figure 2 for Artificial Intelligence of Things: A Survey
Figure 3 for Artificial Intelligence of Things: A Survey
Figure 4 for Artificial Intelligence of Things: A Survey
Viaarxiv icon

Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion

Add code
Sep 15, 2024
Figure 1 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 2 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 3 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 4 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Viaarxiv icon

D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Add code
Jun 18, 2024
Figure 1 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 2 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 3 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 4 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Viaarxiv icon