Picture for Munan Ning

Munan Ning

LLMBind: A Unified Modality-Task Integration Framework

Add code
Mar 08, 2024
Viaarxiv icon

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Add code
Feb 04, 2024
Figure 1 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 2 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 3 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 4 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Viaarxiv icon

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

Add code
Dec 27, 2023
Figure 1 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 2 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 3 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 4 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Viaarxiv icon

Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Add code
Nov 21, 2023
Viaarxiv icon

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Add code
Oct 14, 2023
Figure 1 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 2 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 3 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 4 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Viaarxiv icon

Album Storytelling with Iterative Story-aware Captioning and Large Language Models

Add code
May 24, 2023
Figure 1 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 2 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 3 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 4 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Viaarxiv icon

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

Add code
May 24, 2023
Viaarxiv icon

Temporal Contrastive Learning for Spiking Neural Networks

Add code
May 23, 2023
Viaarxiv icon

MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation

Add code
Jan 18, 2023
Viaarxiv icon