Picture for Ning Zhang

Ning Zhang

Sid

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Add code
Dec 13, 2024
Viaarxiv icon

Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation

Add code
Dec 03, 2024
Viaarxiv icon

Accelerating Multimodel Large Language Models by Searching Optimal Vision Token Reduction

Add code
Nov 30, 2024
Viaarxiv icon

Sequential LLM Framework for Fashion Recommendation

Add code
Oct 15, 2024
Figure 1 for Sequential LLM Framework for Fashion Recommendation
Figure 2 for Sequential LLM Framework for Fashion Recommendation
Figure 3 for Sequential LLM Framework for Fashion Recommendation
Figure 4 for Sequential LLM Framework for Fashion Recommendation
Viaarxiv icon

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

Add code
Oct 08, 2024
Figure 1 for Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach
Figure 2 for Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach
Figure 3 for Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach
Figure 4 for Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach
Viaarxiv icon

ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue

Add code
Sep 26, 2024
Figure 1 for ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Figure 2 for ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Figure 3 for ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Figure 4 for ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Viaarxiv icon

MGSA: Multi-granularity Graph Structure Attention for Knowledge Graph-to-Text Generation

Add code
Sep 16, 2024
Viaarxiv icon

Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs

Add code
Sep 16, 2024
Figure 1 for Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs
Figure 2 for Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs
Figure 3 for Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs
Figure 4 for Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs
Viaarxiv icon

SoK: Security and Privacy Risks of Medical AI

Add code
Sep 11, 2024
Figure 1 for SoK: Security and Privacy Risks of Medical AI
Figure 2 for SoK: Security and Privacy Risks of Medical AI
Figure 3 for SoK: Security and Privacy Risks of Medical AI
Figure 4 for SoK: Security and Privacy Risks of Medical AI
Viaarxiv icon

Self-Supervised Multi-Scale Network for Blind Image Deblurring via Alternating Optimization

Add code
Sep 02, 2024
Viaarxiv icon