Picture for Mi Zhang

Mi Zhang

Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding

Add code
Nov 15, 2024
Viaarxiv icon

Autoregressive Models in Vision: A Survey

Add code
Nov 08, 2024
Figure 1 for Autoregressive Models in Vision: A Survey
Figure 2 for Autoregressive Models in Vision: A Survey
Figure 3 for Autoregressive Models in Vision: A Survey
Figure 4 for Autoregressive Models in Vision: A Survey
Viaarxiv icon

Artificial Intelligence of Things: A Survey

Add code
Oct 25, 2024
Viaarxiv icon

Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion

Add code
Sep 15, 2024
Figure 1 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 2 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 3 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 4 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Viaarxiv icon

D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

Add code
Jun 14, 2024
Viaarxiv icon

Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images

Add code
Jun 11, 2024
Figure 1 for Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images
Figure 2 for Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images
Figure 3 for Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images
Figure 4 for Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images
Viaarxiv icon

Navigate Beyond Shortcuts: Debiased Learning through the Lens of Neural Collapse

Add code
May 09, 2024
Viaarxiv icon

LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival

Add code
Mar 16, 2024
Viaarxiv icon

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Add code
Mar 15, 2024
Figure 1 for SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
Figure 2 for SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
Figure 3 for SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
Figure 4 for SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
Viaarxiv icon