Picture for Junwu Zhang

Junwu Zhang

Open-Sora Plan: Open-Source Large Video Generation Model

Add code
Nov 28, 2024
Figure 1 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 2 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 3 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 4 for Open-Sora Plan: Open-Source Large Video Generation Model
Viaarxiv icon

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

Add code
Jul 28, 2024
Viaarxiv icon

Envision3D: One Image to 3D with Anchor Views Interpolation

Add code
Mar 13, 2024
Viaarxiv icon

LLMBind: A Unified Modality-Task Integration Framework

Add code
Mar 08, 2024
Viaarxiv icon

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Add code
Feb 04, 2024
Figure 1 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 2 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 3 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Figure 4 for MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Viaarxiv icon

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

Add code
Dec 27, 2023
Figure 1 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 2 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 3 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Figure 4 for Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Viaarxiv icon

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Add code
Oct 14, 2023
Figure 1 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 2 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 3 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 4 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Viaarxiv icon

Learnable Privacy-Preserving Anonymization for Pedestrian Images

Add code
Jul 24, 2022
Figure 1 for Learnable Privacy-Preserving Anonymization for Pedestrian Images
Figure 2 for Learnable Privacy-Preserving Anonymization for Pedestrian Images
Figure 3 for Learnable Privacy-Preserving Anonymization for Pedestrian Images
Figure 4 for Learnable Privacy-Preserving Anonymization for Pedestrian Images
Viaarxiv icon

Learning Periodic Tasks from Human Demonstrations

Add code
Sep 28, 2021
Figure 1 for Learning Periodic Tasks from Human Demonstrations
Figure 2 for Learning Periodic Tasks from Human Demonstrations
Figure 3 for Learning Periodic Tasks from Human Demonstrations
Figure 4 for Learning Periodic Tasks from Human Demonstrations
Viaarxiv icon

A Robot Cluster for Reproducible Research in Dexterous Manipulation

Add code
Sep 22, 2021
Figure 1 for A Robot Cluster for Reproducible Research in Dexterous Manipulation
Figure 2 for A Robot Cluster for Reproducible Research in Dexterous Manipulation
Figure 3 for A Robot Cluster for Reproducible Research in Dexterous Manipulation
Figure 4 for A Robot Cluster for Reproducible Research in Dexterous Manipulation
Viaarxiv icon