Picture for Munan Ning

Munan Ning

LLMBind: A Unified Modality-Task Integration Framework

Add code
Mar 08, 2024
Viaarxiv icon

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Add code
Feb 04, 2024
Figure 1 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 2 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 3 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 4 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Viaarxiv icon

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

Add code
Dec 27, 2023
Figure 1 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 2 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 3 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 4 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Viaarxiv icon

Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Add code
Nov 21, 2023
Viaarxiv icon

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Add code
Oct 14, 2023
Figure 1 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 2 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 3 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 4 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Viaarxiv icon

Album Storytelling with Iterative Story-aware Captioning and Large Language Models

Add code
May 24, 2023
Figure 1 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 2 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 3 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 4 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Viaarxiv icon

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

Add code
May 24, 2023
Viaarxiv icon

Temporal Contrastive Learning for Spiking Neural Networks

Add code
May 23, 2023
Figure 1 for Temporal Contrastive Learning for Spiking Neural Networks
Figure 2 for Temporal Contrastive Learning for Spiking Neural Networks
Figure 3 for Temporal Contrastive Learning for Spiking Neural Networks
Figure 4 for Temporal Contrastive Learning for Spiking Neural Networks
Viaarxiv icon

MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation

Add code
Jan 18, 2023
Viaarxiv icon